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Abstract 



For the past five seasons, the Angstrom Project, an international microlensing 
collaboration, has been making observations of the central bulge of M31, the An- 
dromeda galaxy, searching for microlensing events. This thesis describes the work 
that has been done to develop an automatic candidate selection pipeline which 
enables lensing candidates to be found even if they are blended with a periodic 
variable baseline, something which has never been attempted before in the same 
way. As a by-product of this process, many variable stars are found and their 
properties are investigated and characterised. The results of the investigations to 
date are presented. The final selection of microlensing candidates selected from 
the most recent Angstrom lightcurve data set is shown, and a separate more 
detailed investigation into one particularly interesting microlensing candidate of 
very short duration is described. 
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Chapter 1 
Introduction 



1.1 Introduction 

In this Chapter, a brief review of the history of microlensing is given, along with a 
summary of the basic physics of the area. Next, descriptions of the major recent 
microlensing surveys are made, along with highlights of their major results. Then 
follows a description of our own Angstrom project, and a statement of the aims 
of this thesis. 

1.1.1 The Geometry of Lensing 

As is illustrated by Figure when a massive object (known as the lens, L) 
comes between a light source (S) such as a galaxy or star and us, the observers 
(O), the light can be bent around the massive object by its gravity. 

This means that the source is no longer seen at S, its true position, but rather 
at Si (and S2, as the light can alternatively be bent around the opposite side of 
the lensing object). In Figure ITTT1 the light paths are drawn as straight lines, but 
of course in reality they would be hyperbolae; however since the angles involved 
are actually very small, use of the small angle approximation is justified and does 



1 
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not cause significant errors in practical situations. The microlensing regime is 
that in which the two (or more, for non point-like lenses) images are unresolved 
by the observation, as opposed to macrolensing, where all images are separately 
observed. 



1.2 The History of Microlensing 



One of the ma ny good reviews of this subject can be found in 



Wambsganssl (119981 ) 



Soldnerl (J1804J) is generally credited with performing the first calculations of the 
deflection of light, or rather a body of velocity c, by gravity, but because he used 
classical Newtonian mechanics his result underestimated the relativistic result 
given in Equation 11.11 by a factor of 2. The possibility of the phenomenon of 



gravitational lensing was first pointed out by iChwolsonl (I1924J ) . but he did not 
perform any calculations. The first seriou s attempt to calculate the magnitude 
of the effect was made by lEinsteinl (119361 ). In this brief paper, which Einstein 



states was only written because he was persuaded to do so by R.W.Mandle, he 
clearly showed a good understanding of many of the principles thought of today 
as relatively modern discoveries, and already was able to calculate a version of 
the equation for the magnification of the source star as either the source moves 
relative to the line joining the observer and lens (or in Einstein's terminology, the 
observer moves relative to the line joining the source and lens stars). This equa- 
tion (see Equation IX . 7j) is usually referred to nowadays as a "Paczyhski curve" 



after 
ing in 



Paczvhski ( 



Zwicky 



1996). Zwicky discussed many aspects of gravitational lens- 



(119371 ) . However, Einstein was of the opinion that it would be a 
very uncommon effect, due mainly to the very small angular separations between 
source and lens that are required, and the possible "dazzling" from the lens star. 
It seems from the paper that he only considered the motion of the Earth, and 
not the possibility of significant motion of the sources. One imagines that he 
had not anticipated advances in technology which allow us to repeatedly monitor 
thousands of stars simultaneously. Hence his famous quote (see the start of this 
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Thesis), which is repeated twice in lEinsteinl (119361 ) in slightly different terms, and 
his apparent opinion that it was not, therefore, of practical interest. However, 
it is my opinion that the "this phenomenon" to which the quote refers is not 
gravitational lensing of one star by another per se, but more precisely the obser- 
vation of the Einstein ring or two resolved images (see Figure fL2|) . for Einstein's 
second reason for his opinion was that the angular radius of the Einstein ring 
(see Equation 11.51 and Figure II .3j) would "defy the resolving power of our instru- 
ments" , (which is actually still true today) and he had already worked out that the 
lensing magni fication effect could be infinite, for a point source (Equation ll.lOp . 



Tikhov! ( 119371 ) calculated the effect of gravitational lensing on the intensities of 
the light from the source in the general case, but his method was apparently not 
easily followed. The first person to make a simpler explanation of the physics and 
geometry involved and to make an esti mate that the passage of one star behind 



another would be a common event was 
the same time as 



Refsdall (11964 ). He published virtually at 



Liebed (119641 ). who discussed similar issues. Refsdal utilised a 



geometrical diagram very similar to Figure [Tj] to derive the general equation for 
the change in brightness of a lensing event (see Equation |1.7p . The first obser- 
vat ion of a gray i tation ally lensed object was of a double quasar, Q0957 + 561, 



by 



Walsh et al. 



(119791 ) . It was later determined that the two main objects are 
two lensed images of a single background quasar by a foreground galaxy. The 
difference in t he light travel time betwe en the two images was measured to be 
1.03 years, by ISchild and Cholfinl ( 119861 ). and this enabled the geometry of the 
system to be completely established. 



Aft er Einstein, microlensing was next explicitly discussed bv lChang and Re fsda 



(1979). They referred to the subject as "star disturbances". Later, 



(1986a 



Paczyhski 



) was the first to coin the term "microlensing", and gave a description of 



the phenomenon: 
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"A real galaxy is made of stars and some continuously distributed mass: inter- 
stellar matter and possibly some exotic particles (dark matter). A detailed struc- 
ture of a macro-image may depend on the masses and surface density of stars 
which split the macro image into a large number of micro-images, separated by 
some micro-arcsec. Even if we cannot resolve this structure, it may affect bright- 
ness of the macro image and contribute to its variations in time". From this 
quote can be seen the explanation for why the phenomenon is commonly known 
as "microlensing", even though the image separation for stellar lensing is more 
typically on the scale of milli-arcseconds. The name was originally coined to 
describe lensing in galaxies at cosmological distances. 



Later that year, 



Kavser et al 



(119861 ) extended this work in several directions, and 
thereby gave the study of microlensin g a firm foundation . Th e first detection of 

diaaaP and 



a mic rolensing; event was reported by 



Irwin et al 



Vanderriest et al. 



(119891 ) where in the gravitationally lensed quasars QSO 2237+0305 and QSO 
0957+561 respectively, the brightness of one of the lensed images changed relative 
to the other in a way inconsistent with a simple time delay. 



1.3 Microlensing Theory 
1.3.1 Basic lensing theory 

The lensing angle a' is given by Equation 11.11 which was calculated by Einstein 
using General Relativity, 



where £ is the impact parameter of the light ray- the separation on the sky at 
the point of closest approach to the lensing mass. Note that this deflection angle 
is only dependent on the lensing mass, and not on any property (e.g. frequency) 
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of the light involved. Assuming this geometry, it can be seen that 

V = D s <j) = D s 6 — D LS a' (1.2) 

In the case of = and a circularly symmetric mass distribution, i.e. when the 
source and the lensing mass are perfectly aligned with the observer's line of sight, 
it is intuitively obvious that positions Si and S2 will be symmetrically arranged 
along a straight line on either side of S. In this case, some simple arithmetic can 
show that 



UGM D LS 

Since this particular situation is rotationally symmetric, the image of the source 
forms a perfect circle around the position of the lensing object. This is called an 
Einstein ring. 9e is called the Einstein angle. Since 



5? (L4) 



this means that 



This radius Re is known as the Einstein radius and is the physical radius of the 
Einstein ring in the lens plane. There are many observational examples known 
of Einstein rings (and, more commonly, incomplete "arcs" , where the alignment 
between source, lens and observer is not perfect). One example of a recently 
discovered Einstein ring is shown in Figure 11.31 The objects causing the lensing 



in situations such as Figure fL3l are galaxies, which may be clearly visible or not, 
and frequently the source objects are galaxies or quasars. 
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Figure 1.1: Diagram showing the geometry of lensing in the small angle approxi- 
mation. The source of lensed light is S, and has unlensed radial position 77 in the 
source plane and R s in the lens plane. The observer is at O. Si and S 2 are the 
apparent positions of the two lensed images in the source plane. L is the massive 
object which is acting as a gravitational lens. Light from the source S is bent by 
an angle a', giving the image S\ a radial position in the lens plane of £. The angle 
corresponds to the angular position on the sky of the source and the angle 6 
corresponds to the angular position on the sky of the image Si. The angle a is 
the angular change in the apparent position of the source caused by lensing. D L 
= distance of lens from observer. D$ = distance of source from observer. Dls = 
distance between lens and source 
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Figure 1.2: In this drawing of gravitational lensing, the lensing mass is indicated 
with a dot at the centre of the Einstein ring, which is marked with a dashed 
line; the sequential source positions are shown with a series of small open circles; 
and the locations and shapes of the two images are shown with a series of dark 
ellipses. At any instant the two images, source and the lens are on a sin gle line, as 



shown in the figure for one particular instant. Figure is Figure 3 from iPaczyhski 



(119961 ) . (Reprinted, with permission, from the Annual Review of Astronomy and 
Astrophysics, Volume 34 1996 by Annual Reviews www.annualreviews.org) 



1.3.2 Microlensing magnification 



As can be seen in Figure fl~2| for a point-like lens, there will be two lensed images, 



one on either side of the lens, one inside and one outside the 



any lensing conserves surface brightness, fjSchneider et al 



inst ein ring. Since 



199J), the ratio of 



image to source intensity is given by the ratio of their areas. With the geometry 



as shown in Figure ll.l[ this ratio can be calculated as 



Ra 



dR +t 
dR, 



u 



2u^(u 2 + A) 



±0.5 



1.6) 
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Figure 1.3: An almost complete Einstein ring found around a Giant Luminous 
Red Galaxy (GLRG). This is a Sloan Digital Sky Survey (SDSS) view of the sky 
composed from g, r & i images around the so-called "Cosmic Horseshoe" . Most of 
the objects in the field are faint ga l axies. The blueish color of the ring is striking. 
Image can be found at IBelokurovl (120081) (reproduced courte sy of V.Belokurov). 
The discovery paper can be found at iBelokurov et ajj (120071 ). 



where u = (R s is the position at which the source would appear in the 
absence of the lens mass and lensing effect) and it is assumed that the source is 
very small compared to Re, i.e. a "point source". The quantity A is called the 
magnification. The total magnification of the two images summed together can 
therefore be calculated as 



> = -£±L= (1.7) 
This equation has been simplified by assuming a point source and point lens. 
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If either of the above are not the case, then the equation becomes much more 
complicated. 

The increase in flux from the baseline value F n in a lensing event is given by the 
formula first written down by 



Einstein 



(119361 ) which is: 



= f[u\t)} where f(x) = 

0c (4 + x) 



- 1 



The two Equations fll.7|) and f ll.Sp are simply related due to Equation 11.91 



A m = (z^i = F_ 1=A _ 1 (L9) 

F F F 

Since the original flux of the lensed object is not able to be measured in the pixel 
lensing regime (see Section 11.3.31 for a definition of this) only the quantity / can 
be discerned by measuring the flux. 

In the case that u — > then Equation 11.71 can be approximated by 



A=- (1.10) 

u 

Taking the limit, u = leads to the surprising conclusion that the magnification is 
infinite, when the source is precisely behind the lens from the point of view of the 
observer. Of course this is only true if the source is a point of zero spatial extent, 
which is unphysical. The magnification is given by the ratio of the summed area 
of the two images divided by the surface area of the source. The reason the 
infinity occurs is because one must formally divide the area of an infinitely thin 
annulus (Einstein ring) by the area of an infinitely small circle, which approaches 
zero faster than the area of the annulus as u — > 0. 

The dimensionless impact parameter u (in units of Re) of lens and source on the 
sky varies as a function of time as the source passes the lens with a minimum 
impact parameter as in Equation 11.111 
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u\t) = (3 2 + 



t E 



A.ll) 



where £g = Rs/vt is the Einstein radius crossing time (the time taken by the 
source to cross the Einstein radius of the lens with a relative transverse velocity 
v t ) and the time of peak flux is given by t . When u from Equation 11.111 is 
substituted i nto Equation 11.71 t he resulting equation is known as the "Paczyhski 



curve", a: 



ter 



Paczyhski 



( 119961 ). but an equivalent form was known already by 
Einstein! (Il936[ ). For large u, the magnification approaches 1, and for u — > 0, 
A oo. For high magnification events, A approximates to vT 1 . This is only 
true for the assumed point source. If the source is finite in transverse extent, the 
divergence at u = becomes "rounded off" and hence A always remains finite. 



It has been shown by 



Gondolol fjl999l ) that the measured full width, half maximum 



timescale of a lensing event ^fwhivi is given by Equation 11.121 



*FWHM = ^W(^) (1.12) 

where oo((3) is given by Equation 11.131 



wtf) = y/2f\f(P)]-F 



1.13) 



and the function / is that in Equation 11.81 



It was shown by iBaltz and Silk! (120001 ) that there exist useful limiting values 
of the function u(/3), which are: for ( 8 <^ — Bv3 and for (f3 3> 1), oj ~ 
/3{y/2 - 1) s . However, Baltz and Silkl fl2000h use a different definition of t E which 
is the same as the one used by the MACHO collaboration, namely t E = 2RE/vt- 

Taking the first of these two limits, for the "high magnification regime" where 
(f3 1), and substituting into Equation (I1.12p results in Equation fl 1 . 14j) in terms 
of our definition of 
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tFWHM = 2V3t E P{P « I) 



;i.u) 



Since when u is small compared to the Einstein radius the magnification may be 
approximated by the inverse of Equation 11.111 substituting for t E using Equation 
11.141 leads to Equation 11.151 



SF(t) 



u(t) 13 



12 



(t-tp) 



tpWHM 



1 

2\ 2 



;i.i5) 



Here the non-degenerate variables have been "reduced" from 4: to, Fq, ft and t E 
to the 3 combinations: t , ^ and £fwhm (or 2\^3(3t E ), and so this equation is 
known as the "reduced Paczyhski Curve". 

Finite Source Effects 



If the angular diameter of the source is comparable to that of the Einstein ring 
of the lens, then the simple Paczyhski curve is no longer valid. The central 
magnification is no longer infinite, and the central region becomes broadened to 
reflect the diameter of the now finite source. The lightcurve for a finite source (but 
still a point lens) becomes the convolution of the Paczyhski curve appropriate to 
each point on the star's surface with the luminosity distribution on the face of the 
star, along the path of the lens as it passes the star. Therefore, to derive the full 
expression, it is necessary to take into account such things as the limb darkening 
of the source star, which necessitates integrating the equations over the whole 
face of the star, with all its luminosity changes. Hence the process may become 



ve ry mathematica 



in 



Witt and Mao 



ly com plex. The full expression and its derivation is de scribed 



19941 1 and in an alternative but complementary way in 



Gould 



(jl994h . 
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Microlensing optical depth 

An important parameter for microlensing is the optical depth, r. It is defined 
as the number of lenses inside a tube along the line of sight with transverse size 
equal to the Einstein radius of the lens. Mathematically, this can be written as 
in Equation 11.161 where p(x) is the density distribution along the line of sight 
and m is the mass of the lenses being considered. D$ is again the distance to the 
source and hence the length of the line of sight being considered. 



When the optical depth is much less than one it is equivalent to the probability 
that a source along the line of sight will be lensed. Of course, in real situations 
the magnitude of the integral of p along the line of sight will vary depending on 
the particular line of sight, meaning that in order to estimate event rates for a 
survey such as Angstrom where the function p varies significantly across the field 
the integral must also be performed over the two transverse directions as well as 
over x. In addition, the sources will vary in luminosity, which may mean that 
not all sources are detectable in real situations with the limitations of physical 
observations. 

1.3.3 Pixel Lensing 

When microlensing is observed in the Milky Way, in the great majority of cases 
it can be expected that the light from the source will be the dominant source of 
light detected in the lightcurve. However, when observations are carried out in 
other galaxies, even in M31 (the nearest galaxy of comparable size to the Milky 
Way) it is certain that in every pixel of a CCD image there will exist many stars, 
whose light is combined. This makes the interpretation of the changes in the flux 
in any one pixel much more complicated, and not only will the light from many 
different stars be present, but several of them may even be varying at the same 
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time. 



When a lensed source is well resolved, it is possible to observe all four of the 
physical parameters of the lensing event; the baseline flux O , the normalised 
impact parameter /3, the time of maximum flux to and the Einstein timescale 
tfi. However, as in the case of Andromeda, where the density of stars in the 
field of view of the telescope is very large and the sources are unresolved, there 
will be many sources in each pixel of the image. Therefore, the baseline flux 
observed before the lensing event takes place is the sum of the flux of all the 
unresolved sources. This means that we do not know the original unlensed flux 
of the particular source which is lensed. 

Equation 11.111 can be re-written as Equation 11.171 if one writes t e ff = fit^- 



If A{t) is the magnification given in Equation ll.7[ which is a function of time 
due to the variation in u in Equation ll.ll[ and (f) is the total observed flux, 
is the summed flux of all the background sources including the source star (in 
the absence of lensing) and 0* is the original (unknown) flux of the source, then 
Equation 11.181 describes the flux variation in the pixels covering a microlensing 
event. 



Since (p^ = 4> re st + 0*, where rest is the total flux of all other sources in the pixel 
other than the lensed source, this can be written 



{u{t)t E f = t eff 2 + {t-t ) 2 



(1.17) 





= <M(0 + <A 



rest 



(1.19) 



The only information about the lensing event that can be gleaned comes from 
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the change <f>*(A(t) — 1) which is observed in the flux, and, experimentally, can 
only be detected if the change in the flux is greater than the statistical error in 
photon counts iV in the pixel, which is y/~N for purely Poisson noise. In other 
words, if N+ is the number of photon counts due to the source, and N ga i is the 
number of photon counts due to galaxy surface brightness, then the criterion for 
detection is that 



{A — l)iV* ~ u~ 1 N* > kN^ al (1.20) 

where k is a constant, often taken as 3. The value of u when the criterion above 
is just met will be referred to as up Therefore, only the so-called "visibility 
timescale", t v , when the lensed flux is visible above the noise, rather than the 
intrinsic Einstein timescale tg of the lensing event, can be directly measured. 
Thus the visibility timescale can be written 



t v = 2{u 2 t - /3 2 ) 1 — = 2{u 2 t - u 2 )h E (1.21) 

where 9e is as defined in equation 11.31 and /i is the relative proper motion of the 
lens across the line of sight. Therefore the visibility timescale depends on the 
square root of the lensing mass. 

The fact that only t v can be observed leads to an ambiguity /degeneracy between 
the true lensing timescale and the true impact parameter (3 (or alternatively the 
source star flux </>*), as, for e.g., a long timescale, low magnification event can 
appear very similar to a short timescale, high magnification event, if different 
values for 6 hn are assumed. 



Gouldl (119961 ) took account of this degeneracy when he approximated the Paczyhski 
curve with one fewer parameter for the special case of high magnification. His 
equation is given in Equation 11.221 
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A<f) 



teff 



(1.22) 



where the three free parameters are: <j> e ff = t e ff and to. 0o is the peak flux 
at the peak time to- It can be seen that this is an exactly equivalent formulation 
to Equation I1.15[ if t e f f is replaced by the correct function of £fwhm found by 
substituting for t e ff in Equation 11.141 

In the same paper, Gould concluded that pixel lensing was potentially a very 
sensitive and widely applicable technique. He showed that there are two distinct 
regimes of pixel lensing, which he labelled "semi-classical" and "spike". The 
boundary between these two regimes is a certain value F max (which is defined 
in the paper) of the original unlensed flux of the source star. Stars which have 
unlensed flux Fq > F max are in the "semi-classical" regime, which means that they 
can be recognised even when they have low impact parameters, and it is usually 
possible to extract information about the timescale and F Q separately from 
these events. On the other hand, for source stars which have unlensed flux Fq < 
Fmax, generally no direct information about the timescale is available. In the 
above paper, various different possible sources of noise and unwanted backgrounds 
which could occur in the process of performing a survey using difference imaging 
in the pixel lensing regime were also discussed. Some possible future applications 
of pixel lensing were given. 



1.4 Microlensing Surveys 



1.4.1 Milky Way surveys 



In 1986, iPaczyhskil (jl986bl ) suggested that a variability search among the mil- 
lions of stars in the Large Magellanic Cloud could be used to detect dark matter 
in the Galactic halo, which the line of sight would pass through. Fortunately, 
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the technology needed to perform such a search became available soon after- 
wards, and the 1986 pa per is credited with in spiring the microlensing searches 
which followed: ERO S flAubourg et all Il993f ). MACHO, which began in 1992 , 



(lAlcock et aL 



and DUO flAlardl . 



1993bh. OGLE, which a 



1996 



Alard et al. 



so started in 



1995 . 



Griest 



9 92. (lUdalski et all Il992l ) 
(119911 ) proposed that ob 



jects responsible for gravitational microlensing be called Massive Astrophysical 
Compact Halo Objects (MACHOs). This term is used to refer to lensing by 
non-luminous objects. 

The name OGLE stands for 'Optical Gravitational Lensing Experiment'. The 
first phase of the OGLE experiment, 'OGLE-I' began in 1992 and observations 
were continued for the next four seasons until the end of the 1995 season, using 
the lm Swope telescope at the Las Campanas Observatory, Chile. Due to lim- 
ited telescope availability, observations were limited to the Galactic bulge, and 
the area covered was relatively small. However, the scientific achievements were 
still considerable and included the first determination o f the microlensing optical 



19941 ) (which was measured as 



depth towards the Galactic bulge (lUdalski et al. 
r = (3.3 ± 1.2) x 1(T 6 . 

EROS stands for 'Experience pour la Recherche d'Objets Sombres', and its main 
aim is research into dark massive objects, otherwise known as 'MACHOs' which 
are gravitationally bound to our Galaxy. This has included observations of the 
Galactic bulge, spiral arms, and both the large and small Magellanic Clouds. 



Early determinations by EROS of the optical depth to the Galactic Centre ( lAfonso et al 



20031 ) gave a value of r = (0.94±0.29) x 10 6 which was not consist ent with the val 



ues c alculated by the MACHO and OGLE groups. Later analysis (IHamadache et al. 



20061 ) with better statistics and a much longer temporal baseline resulted in a 
new value of r = (1.62 ± 0.23) x 10~ 6 e [- a ^ b l- 3 dc ^ with a = (0.43 ± 0.16) deg -1 
which was more consistent with the latest results at that time from MACHO and 
OGLE-II. 



In 



Tisserand et al. 



( 120071 ). optical depth toward the Large Magellanic Cloud was 
calculated to be r < 0.36 x 10~ 7 (95%CL), which would correspond to a halo 
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mass fraction of less than 8%, which disagreed strongly with the MACHO result 
(which is described below). 



The MACHO project (lAlcock et al.Lll993bh is a joint US /Australian collaboration 



whose main stated aim is to test the hypothesis that a significant fraction of 
the matter in our Galaxy (and hence by implication, many other galaxies) is 
composed of low mass, non-luminous objects such as brown dwarfs and planets, 
which could be described as MACHOs. The project first started taking data in 



1993 (Alcock et al. 



1993a ) and began by monitoring 6 million stars in the LMC 



and Galactic bul ge. Results using 5 .7 y ears of obseryations towards the LMC 



were published in 



Becker et al. 



Alcock et al. 



( 120001 ) and in this second 



(119991) and 

paper an optical depth of r = 1.2±g:| x 10~ 7 was calculated using events with 
timescales between 2 and 400 days, based on 13-17 events. This was a significantly 
larger number than the ~ 2 to 4 events expected from lensing by known stellar 
populations, which led t o a predicted MACHO halo fraction of 20%. Using seven 
years of MACHO data, IPopowski et al.l ( 120051 ) were able to use a subset of 42 
out of the total of 62 selected microlensing candidates which had sources which 
were "clump giants" to estimate the optical depth to microlensing towards the 
Galactic bulge as being r = 2.17±g;|| x 10~ 6 at (l,b) = (+1.50 deg,-2.68 deg). 
Giant source stars were chosen for this estimate in order to reduce the problems 
caused by blending of flux from stars other than the true source star. 

Despite the huge amount of effort put into surveys towards both the Milky Way 
and its satellites, the discrepancies between the results of the different major 
collaborations have still not entirely disappeared. When observing towards the 
Galactic Centre, however, it has become clear that the observed differences in 
optical depth are less due to differences between survey methodologies and more 
to do with a clear correlation observed between the measured optical depth and 



the angular positions on the sky of the fields integr ated over. A c 



e ar gra di- 



20051 ) in 



ent in optical depth was observed in MACHO data (IPopowski et al.l . 
the sense that MACHO fields closer to the Galactic Centre have higher optical 
depth. This is clearly consistent with the model predictions of increasing mass 
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density towards the center of galaxies. This gradient has been modelled as a lin- 



ear fit (IPopowski et al 



20051 ) and, when the best fit is removed from data from 



all surveys, results are consistent with one another. The most obvious remaining 
discrepancy between surveys lies in the results of analysis of data from the LMC 
and SMC. As mentioned above, the MACHO pro iect announced the discovery 
of 13-17 microlensing events towards the LMC in lAlcock et al.l ( 120001 ). This is 
significantly more than would be expected from known stellar populations and 
hence they deduce that there must be a significant fraction of dark matter in 



the LMC. EROS, however, only c 



towards both Magellanic Clouds f lTisserand et ah 



aim to have dis c overe d one microlensing event 



2007) . OGLE have pu blished 



only two microlensing events towards the LMC (IWyrzykowski et al.l . 120091 ) . which 
is clearly more consistent with the EROS result than with MACHO. The optical 
depths to microlensing towards the LMC calculated by the EROS and OGLE 
collaborations remain incompatible with that calculated by the MACHO collab- 
oration within published experimental errors. Much effort has already gone into 
attempting to understand where the inconsistency between these results lies, but 
no clear answer has been arrived at. One possible clue may be found in the fact 
that the MACHO fields are more compactly arranged toward the centre of the 
LMC, whereas the EROS fields cover a much larger area. This might have had 
some influence on the relative optical depths expected, but the predictions with 
which the results are compared are arrived at by the different collaborations us- 
ing extensive modelling of their individual circumstances, and should therefore 
also produce consistent answers. The responsibility for further investigation of 
the reasons for the clear discrepancies between the results of different collabora- 
tions towards the Magellanic Clouds will have t o be passed to curre ntly running 



next-generation surveys such as Super MACHO (IStubbs et al.l . 120021 ) . 



Another difficulty, when the statistics are low as they still are in measuring lensing 
towards the LMC and especially the SMC, is how to be certain whether the lens 
that caused a particular event resided in the Milky Way halo or that of the 
Cloud, especially as the two halos may even be continuous. Observations toward 
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the Galactic Centre have to deal with high values of extinction and reddening 
due to dust in the plane of the Galactic disk, which has made making definite 
conclusions about the distribution of matter in the Galaxy much harder. 

The Andromeda Galaxy has several major advantages oy er the Milky Way with 



respect to the difficulties mentioned above (iCrottsl . 119921 ). It lies out of the disk 
plane of our Galaxy, thus significantly reducing the Galactic extinction problem. 
We can explore the inner halo, rather than the outer halo of the Milky Way that 
can be observed in the direction of the LMC, for example. Due to Andromeda 
having a favourable geometry of its own in that it is almost, but not quite, edge 
on to us the modelled lensing probability has a clear variation with respect to 
galactocentric position, meaning that the microlensing origin of events can be 
proved using statistical an alysis. Also, both its gr eater outer rotation speed and 



the Tully-Fisher relation (iTullv and Fisher 



dromeda has a greater mass ( 



Baiesi Pillastrinil . 



1977), for example, imply that An- 



20091 ) , and hence a larger halo than 



the Milky Way, giving it a relatively increased lensing cross section. Of course 
it is also advantageous to be able to observe the halo as a whole from outside, 
rather than being immersed within it. Observations in Andromeda could allow 
resolution of the disagreements over methodology which may have contributed to 
the inconsistent results found in the LMC/SMC observations. Therefore, collab- 
orations began to contemplate making observations towards M31. 



1.4.2 Microlensing Surveys of Andromeda 



The microlensing surveys conducted towards the Andromeda Galaxy began with 



AGAPE, who reported the first microlensing candidate in the direction o: 



detected in 1995, which had a £fwhm of 5.3 ± 0.2 days ( lAnsari et al. 



M31, 



19991 ) 



SLOTT-AGAPE gave a partial analysis o: 
ing images which were stacked nightly in 



their early (1998- 1 999) data, us- 



Calchi Novati et al. 



(120021 ). report- 



ing five candidate events. The analysis of the full data set was concluded in 



Calchi Novati et al. 



( 120031 ) with twice as long a baseline, which resulted in all 
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the five previous candidates being rejected due to additional clear variations in 
the lightcurve being found, but an additional three candidates being put forward. 
These new candidates even the authors stated were "inconclusive". This high- 
lights one of the major issues faced by microlensing surveys in Andromeda which 
is that due to the blending of many stars in the same pixel, some of which are 
likely to be variable, variation of the microlensing baseline due to variable stars 
is a major problem and needs to be seriously considered. Further discussion of 
this issue will be given in Chapter |H WeCaPP, which began in 1 997, described 



their m ethod and gave some sample lightcu rves of variab 



e stars in 



(I200ll ). T he MEGA survey 



NainiTal (IJoshi et al 



(IBelokurov et al. 



Riffeser et al. 



2004 ), followed by th e 



pegan in 1998, fide Jong et all 
20051 ) and POINT- AGAPE Jcalchi Novati et all 120051 ). 
200J) surveys. POINT-AGAPE and MEGA used the same 
data, from the Wide Field Camera on the Isaac Newton telescope, but the two 
surveys used independent data analysis techniques. POINT- AGAPE and MEGA 



have both published results, finding several microlensing candidates. 



The POINT-AGAPE Survey 



The POINT-AGAPE collaboration have so far published two papers giving in 



depen dent and different analys e s of t heir d ata. These were 



(|2005j) and 



Calchi Novati et al. 



Be 



Calchi Novati et al. 



okurov et al. 



(120051 ). 6 mi- 



(120051 ). In 

crolensing candidates were found in their analysis of 3 years of data whose po- 
sitions in M31, taken together with Monte Carlo simulations led them to con- 
clude that they detect e d at least a 20% MACHO contribution to the halo of 



M31. 



Belokurov et al. 



(120051 ) . however, using different cuts, only found three 



top 
that 



evel events. The most imp ortant difference between these two analyses was 



Calchi Novati et al. 



25 days, whereas 



(2005 j) limited t he timescales of their events to less than 



Belokurov et al. 



found fewer events. 



Belokurov et al 



(200 5J) ap plied no limit on timescale, but still 



( 120051 ) did not tend to find short timescale 
events whose lightcurves are contaminated with flux from blended variable stars 
(which seems to be due to the strictness of their first cut, detailed in Chapter 



1.4. Microlensing Surveys 



21 



0J, whereas ICalchi Novati et al.l ( 120051 ) did. More details about some of the cuts 
applied by these two surveys will be made in Chapter H] in relation to the cuts 
applied in the selection procedure used in the work performed for this Thesis. 



The many variab 
ysed in detail by 



e stars which ar e inevitably detected by such surveys were anal- 
et al.l (12004 ) . Knowledge of the spatial distribution of such 
stars has great relevance to the information that can be extracted from surveys 
looking specifically for microlensing events since variable stars are the major 
contaminant for measuring the spatial asymmetry of the microlensing signal pro- 
duced by the high inclination of M31 towards us. The facts that on some lines 
of sight the disk is in front of the densest part of the bulge and in others the 



bulge is in front of the disk, along with the asymmetry on the sky caused 



proposed barred bulge (Ide Vaucouleurs 



1958 



Lindblad 



1956 



Stark 



Dy the 



19771 ). lead 



to likely source and lens populations, and hence the microlensing optical depth, 
varying over the face of the galaxy in a systematic way that can be modelled. It 
has been found that the variable stars share some of this signature, which makes 
extracting an unambiguo us signal f r om m icrolensing candidate spatial distribu- 
tions much more difficult. I An et all J2OO4I ) suggested that one way this could be 
done is to look for an E ast-Wes t asym metry, rather than the near-far asymmetry 
previously proposed in ICrottsI ( 119921 ) , because both the resolved stars and the 



various groups of variables are more or less symmetric about an axis oriented 
North-South. 

The Classical Novae discovered by the POINT- AGAPE survey were reported in 
( 120041 ) . 20 novae were reported, of widely varying speed classes 



Darnlev et al. 



and lightcurve morphologies. These were detected using an automated pipeline 
which minimised the required amount of inspection of the lightcurves by eye. 
This was followed by a more in depth analysis of the nova rate and statistical 



characteristics of the nova population in 



Darnlev et al 



(2006). 
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The MEGA Survey 

MEGA reported 14 microlensing candidates, although they acknowledged that 
their cuts were less stringent than those used by the POINT-AGAPE collabora- 
tion, which may explain the difference in numbers. In addition, th e pixel lensing 



techni ques used by the two teams were different. Specifically, in 



de Jong et al. 



(l2004j ). it is stated that the difference in the number of candidate microlens- 
ing events found by the MEGA and POINT-AGAPE projects was due to the 
"severe cuts" used by POINT-AGAPE, examples of which include the specifica- 
tion that ti/2 < 25 days and the "much higher signal to noise" specified. This 
second claim is not simple to verify, however, as direct comparison of the meth- 
ods used by the different groups is not easy. The MEGA collaboration simply 
specified that Xconst — xjLc > 100, where Xconst i s the \ 2 of the best constant 
it cur ve and xjLc * s the X 2 °f the best reduced Paczyhski 



fit t o the event ri g 
fit. 



of 



de Jong et al. 



(200 4J) were e vidently comparing themselves to the method 



Paulin-Henriksson et al. 



(120031 ) which found 4 candidates and who, in order to 
select high signal to noise candidates, calculated the probability of the bump re- 
lated to the proposed microlensing event being caused by chance, and demanded 
that — ln(P) > 100 in r band and — ln(P) > 20 in at least one other filter. 
The four candidates finally selected, however, had total signal to noise, (defined 

as S/N = Y% FluXl ~e!ron basC ' inc where 60 < S / N < 1600 ' and n is the number of 
points defined as "in the peak"), which was apparently high compared to typi- 
cal events in t he da tabase. Contrary (in some circumstances) to this, however, 



de Jong et al. 



2004 ) used a Xp ac /d.o.f. cut which could be as low as < ~ 1.5 in r 
band and < ~ 2.0 in i band but would usually be looser for higher signal to noise 
events (depending on the total significance, or "signal to noise" as defined above, 
of points in the peak) according to the recipe Xp ac /d-o.f.< 1-5 + 0.1(^p — 1), 
where S/N is as defined above, and n is again the number of points in the peak. 
So, for example, if there were only 5 points in the peak, with S/N contributions of 
(3, 3, 3, 5, 5) (i.e. 3 with 3cr above the baseline and 2 with 5a), then S/N would be 
19, and the Xp ac /d.o.f. limit would be 1.5 + 1.8 = 3.3. (Of course, this formula can 
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break down if the bump goes below the baseline on average, but clearly only pos- 



itive bumps will be selected). Whereas the y 



at a lower level, 



Paulin-Henriksson et al. 



/d.o.f. cut described above began 



(|2003|) used a flat cut of y 2 /d.o .f. < 5 



, and, later, another POINT-AGAPE analysis ( ICalchi Novati et all 120051 ) used 
only Xpac/d-o.f.< 10. The reason that direct comparison is difficult is not just 
because cuts with a particular purpose are constructed differently It is also that 
even if one cut is indeed tighter in one project's analysis, this can be more than 
compensated for by another cut (or cuts) being less strict, meaning that the total 
effect on the num ber of selected candi date events is difficult to predict. However, 
as pointed out in lAlcock et al.l (120001 ) , there is always a strong element of sub- 
jectivity to setting the types and levels of cuts, and every group seems to do this 
in subtly (or unsubtly) different ways. However, if the following (usually Monte 
Carlo) selection efficiency calculations are done properly, the corresponding effi- 
ciencies calculated should compensate for the number of events selected, and the 



resulting optical depth to microlensing r should be the same. 



1.4.3 Development of difference imaging 



Most microlensing surveys in recent years have used a technique called Difference 
Image Analysis (DIA), otherwise called "image subtraction". 



The feasibility of this technique was demonstrate d in 



To manev and Crottsl (119941) 



first proposed for use in microlensing surveys by lTomanev and Crottsl (11996) and 
thereafter utilised in the Columbia/VATT survey, which was the pilot study for 
MEGA. It was first developed for a M31 survey because the crowding problem for 
variable s ources is much gre ater there than in the Milky Way. Later, from 1998 



onwards (Dr ake et al. 



19981 ) it was first employed in Milky Way by the MACHO 



survey of the Milky Way bulge. 

The basis of this method is that if it is possible to scale and subtract two images of 
the same region of sky taken at different times, the resulting image will show only 
the objects whose flux has changed between the two epochs. Since bright non- 
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variable stars will effectively disappear during this process, this method enables 
weakly varying sources to be detected with far greater sensitivity. 



The difference imaging technique was first auto mated by 



Alard and 



m. 



;UPt 



prJ jl998j) 



in their "Optimal Image Subtraction" method flAlard and Luptonl . 119981 ). In this 
method a reference image, which is usually a good image with as small and cir- 
cular point spread function (PSF) as possible, is convolved with a kernel (K) 
which is a function which describes how to transform the shape of the PSF of the 
reference image (R) to that of any other chosen image (/). A smooth function 
is used to model the background galaxy surface brightness (B) in the resulting 
subtracted image and then the residuals to the fit are minimised using the linear 
least-squares method in order to approximate the true difference image. This 
process can be summarised by Equation 11.231 



5> 



Xi j Vi 



mm 



(%2[(R{xi, Vi) ® K{u, v)) - I(xi, Vi ) - B(x h Vi)} 2 } (1.23) 



where x^y^ are the coordinates of each pixel i, (u,v) are coordinates centred on 
the middle of the kernel, and "®" represents a convolution. 



To make their code more computationally efficient lAlard and Luptonl ( 119981 ) de- 
cided to decompose their kernel function into the sum of several Gaussian basis 
functions which are also scaled by a polynomial function of position. This process 
can be summarised by Equation 11.241 



K(u,v)= a m exp(-(u 2 + v 2 )/2a k 2 )u i v j (1.24) 

(k<N,ij)(i<n k ,(i+j)<n k ) 

where K is composed of N Gaussian-like components each with width given by 
<7fc, and the range of the two indices i, j is given by the degree rik of the polynomial 
associated with each component. The scaling factors individual to each 

summed component, so the values of m are implied by the summations over the 
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other indices. 

The least squares fitting is often performed within sub-regions of the image, 
known as "stamps" due to their square shape, which can drastically reduce com- 
puting time. This requires an assumption that the background and PSF do not 
change significantly on the spatial scale of the stamp, which may not always be 
true. 

We are also using this method in the Angs t rom p roject, using a modified version 



of the "ISIS" code of 



Alard and Luptonl (119981 ). We have modified this code 



substantially, extracting the particular subroutines which perform the difference 
imaging and integrating them with the ADAP. This code now uses the standard 
cfitsi/o routines, and is able to utilise image masking, unlike ISIS. The sequence 
of processes performed in the complete Difference Imaging Pipeline is described 
below, in Section 13.3.11 



1.4.4 The study of variable stars from large area surveys 



As described in 



Cook! ( 119961 ). among others, the time series made available by 



the high cadence monitoring performed by microlensing surveys such as MA- 
CHO, OGLE and EROS, have revolutionise d the level of knowledge of yariab 



1993). 



stars in, for example, the Magellanic Clouds (IThe MACHO Collaboration 
Virtually complete catalogues have been constructed for many of the more com- 
mon types of variable star, suc h as RR Lyrae , Ceph eids and long period vari- 



ables. The DIRECT project, in 



Bonanos ct al. 



( 120031 ) used difference imaging to 



look for variable stars in M31, in order to attempt to directly measure the dis- 
tances to this galaxy and also M33. This search covered a much wider area than 
Angstrom, being 22' x 22' and thus did not concentrate on the M31 bulge. In 
Bonanos and Stanekl (120031 ). a difference imaging process was also used, to make 
observations in M83, looking for variable stars, particularly Cepheids. 
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1.5 The Angstrom Project 



1.5.1 Rationale 



The Angstrom (Andromeda Galaxy Stellar Robotic Microlensing) project is using 
five telescopes to conduct a high cadence microlensing survey of the bulge of M31. 
The collaboration is led from the United Kingdom and has members in Korea and 
the United States. M31 is considered a good target for a lensing survey for the 
reasons already given in Section 11.4.11 In addition, the bulge can be covered in a 
single telescope field. The main aim of this survey is to detect microlensing events 
with durations above 1 day due to low mass stars and brown dwarfs in M31. Pre- 
vious M31 surveys have not observed as frequently as we are doing and so were 
not sensitive to these short timescale events and hence to lens masses as low as 
we are sensitive to. We are performing difference analysis in "real time", i.e. 
analysing one day's data in less than a day. With sufficient statistics we should 
be able to provide valuable information abou t the low mass end o f the stellar 
Initial Mass Function (IMF) in Andromeda fjBaltz and Silkl . 120011 ). and about 



the distribution of mass in the bulge. The data will also he 
parameters of the galactic bar in the central regions of M31 



p to constrain the 



1994 



Walterbos and Kennicutt 



Stark and Binney 



19871 ) The most recent investigation of the cen- 



tral regions of M31, a photometric analysis of infr a-red survey data fr om "Two 



Micron All Sky Surv ey" (2MASS) 6X program was 



Beaton et al. 



Beaton et al. 



(120051 ) and then 



( 120071 ). which found that the axis of the "boxy" bulge is oriented at 



approximately 10° to the major axis of th e outer disk, and co nfirmed that M31 



Kerins et al 



(120061 ) . the stellar bar is 



is a barred spiral, like the Milky Way. In 
modelled as a triaxial elliptical distribution with one axis having a greater scale 
length than the other two by a factor 1/0.6. Since this long axis is predicted to 
be misaligned with the major axis of the disk, this should enhance the event rate 
distribution in two of the four qu adrants defined with respect to the disk orien- 



tation. This has been modelled in 



Kerins et al. 



(120061 ) for different combinations 



of disk and bulge models, to produce spatial distributions of the predicted event 
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rate for each, shown in Figure 11.41 The nearly edge-on orientation of Andromeda 
to us (77°) should accentuate any asymmetry between the near and far sides of 
the galaxy in event rate or in the positions of lenses and sources. The far side 
would be predicted to produce more far-disk sources and bulge lenses, whereas 
the near side would be predicted to produce more bulge sources and lenses in 
the near disk due to the varying lines of sight through the galaxy. Due in part 
to the differences in mass/luminosity functions between the bulge and disk, the 
angle at which the galaxy is viewed, and the probable existence and particular 
orientation of a bar, the number of events expected on the far side of the galaxy 
is higher. These effects are enhanced over a given angular field of view, due to 
the foreshortening effect of the almost edge-on galaxy. There is also an additional 
effect due to absorption of the light from the far side of the galaxy by dust, which 
may reduce the number of detected events from the far side, or even change their 
distribution. 



Our calculations flKerins et al. 



20061 1 estimate that approximately 27 events per 
season would be detected by the Angstrom Project over the field of view of the 
BO AO telescope, which is 11 arcmin 2 , if an IMF which is truncated at the hy- 
drogen burning limit is ass umed. (This corres ponds to the "light" or "standard" 



stellar IMF model used in 



Kerins et al. 



( 120061 )). Of these events, about half last 



less than 10 days. However, if the IMF contains a substantial brown dwarf popu- 
lation, as in the "heavy" IMF models, the predicted yield rises to about 64 events 
per season. Of these, about 80% have timescales of less than 10 days. 



1.5.2 Angstrom Telescope Network 

Angstrom has used five telescopes in the Northern Hemisphere, fairly well spaced 
in longitude to provide good time coverage. These are: the Liverpool Telescope 
(henceforth "LT"), on La Palma in the Canary Islands, the Faulkes Telescope 
North (FTN) on Maui in Hawaii, the Hiltner telescope on Kitt Peak in Arizona, 
the Doyak telescope at the Bohyunsan Observatory in Korea and the Maidanak 
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Figure 1.4: The p ixel lensing spatial distributions for galaxy models 1-5 of 
Kerins et al.l (120061 ) . including events with visibility timescales t v = 1-100 days. 
The origins of the plots are the centre of M31 and the pink cross indicates the 
major and minor axes of the M31 disk. The major axis is the line going top left to 
bottom right. Event rate contours are shown in black and are labelled in events 
per year per arcminute 2 . For comparison, the green contour indicates a rate of 
1 in those units. Model 1: "Light" disk, "Light" exponential bulge. Model 2: 
"Heavy" disk, "Heavy" exponential bulge. Model 3: "Light" exponential bulge, 
"Heavy" disk. Model 4: To explore the effect of the luminosity function, this 
model has a disk luminosity function and "Heavy" disk mass function for both 
the bulge and disk populations. Model 5: "Light" disk, power law bulge. 



telescope in Uzbekistan. The LT and FTN telescopes are both 2 metre diameter 
robotic telescopes, the Hiltner is 2.4 m, the Doyak is 1.8 m and the Maidanak 



is 1.5 m. As can be seen from Figure fL5| the telescope locations are reasonably 
evenly spaced around the circumference of the Earth, which gives the best chance 
to get evenly spaced data points several times over the course of a day. 

Data were only obtained from the Maidanak telescope in Uzbekistan during the 
fourth season, but due to the excellent seeing properties of Mt. Maidanak, 
on which the telescope is sited, the data are of high quality. Data from the 
non-robotic telescopes are pre-processed and difference imaging is performed on 



1.5. The Angstrom Project 



29 



Locations of Angstrom telescopes 



Faujkes 
Telescope 
North 

Maui, Hawaii 
2 m, robotic 




Figure 1.5: A Map showing the locations around the Earth's circumference of the 
Angstrom telescopes. 

them in Korea, while data from the robotic (LT and FTN) telescopes are pro- 
cessed in the U.K, where all the data from the different telescopes are combined 
into lightcurves. In addition to new data from the telescopes described above, 
Angstrom has access to the POINT-AGAPE project image data. These have 
been re-processed using the Angstrom data analysis pipeline. The use of these 
data is of great benefit to us as it significantly increases the length of the temporal 
baseline of our data, which in turn helps to distinguish true lensing events, which 
usually do not repeat, from variable stars, which repeat. The fields observed by 
the POINT-AGAPE and MEGA collaborations partially overlap with the fields 
which the Angstrom collaboration are currently observing. The Angstrom fields 
are more centrally concentrated than the POINT-AGAPE fields which have a 
gap in the very centre. However in the areas which do overlap the benefits to the 
analysis of the data are very significant, albeit with sparser time sampling than 
Angstrom. 

A graphical summary of the Angstrom collaboration pipeline is given below in 
Figure 11.61 Already preprocessed data from all five telescopes are sent to Liver- 



1.5. The Angstrom Project 



30 



Table 1.1: Table showing the various sizes of the CCD chips used in the Angstrom 
Survey. 



Telescope 


Mean CCD size (arcmin) 


aspect ratio 


LT, FTN 


4.6 


1 


Doyak 


11 


1 


Maidanak 


18 


1 


Hiltner 


4.5 


1.52 



pool for processing. The stages performed include PSF quality control, removal 
of cosmic rays, defringing, alignment and stacking. Next the difference imaging is 
performed on both the Angstrom and PA data, and variable sources are detected 
in the difference images. The lightcurves of variable objects which are produced 
are then fed into the candidate selection pipeline developed for this thesis, which 
selects the microlensing (and variable star) candidates. The sam e lightcurves 



educate the Angstrom Project Alert System (iDarnley et al.l . I2007al ) (see Section 
ll.5.4p . Any promising alerts can then be distributed to the wider astronomical 
community who may make follow up observations of interesting transient targets. 



1.5.3 Observing Strategy 

As mentioned above, many of the predicted microlensing events have timescales 
of 10 days or less. Therefore we are taking data at the rate of several (~ 2 — 3) 
epochs each night when possible. This should enable us to characterise events 
with a timescale of only a few days. Observations were initially made in multiple 
wavelengths in order to assist with the identification of events. This was because 
microlensing events are achromatic whilst non-microlensing events are likely to 
be chromatic. The LT observed in Sloan i' and r' filters, the FTN in Sloan i' and 
Bessel R. The Doyak telescope is using Johnson- Cousins I and the Hiltner uses 
Gunn-Thuan i. The Maidanak telescope data are also Johnson- Cousins I band. 
The differing sizes of the CCDs used on the Angstrom telescopes are shown in 
Table 11.11 The Maidanak telescope has a 4096x4096 pixel Fairchild CCD camera, 
which gives a pixel size of 0.26". 
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Each exposure in the sequence is kept short (no more than 200 seconds) to avoid 
saturating the M31 bulge. In the pilot season, the total exposure time of each 
epoch was 10 minutes, but in subsequent seasons this was increased to 30 minutes 
in order to increase our sensitivity to microlensing anomalies. Unfortunately, the 
LT r and FTN R band data proved at an early stage to be of lower quality, with 
fewer variable objects visible. For a fixed allocation of telescope time, i' band data 
give a greater sensitivity to variable stars, thus allowing better exclusion of false 
positives than by using a sparser time sampling spread between two bands (for 
e.g. (i' + r) or (i' + R)) Unfortunately this meant that doing a comprehensive 
check as part of the candidate selection process for achromaticity was not always 
possible, but the pipeline was written in such a way that if the data existed, the 
achromaticity condition could be utilised, perhaps in this project in future, or in 
future surveys. 



1.5.4 The Angstrom Project Alert System (APAS) 



As part of the Angstr om Project, a real time alert system has been developed 



(jDarnley et al. 



2007a] ) to enable the alerting of interesting developing events to 



the astronomical community so that they may be followed up with other tele- 
scopes, as has been done for several years by the Milky Way surveys such as 
OGLE and MOA. The successful operation of this kind of system requires the 
Angstrom data to be processed in "real time" . This has been achieved and in the 
observing season 2007 /2008 real alerts were issued on promising looking events 



( Darnlev et al. 



2007bl ). A more detailed description of the operation of the APAS 



will be given in Section 13.21 



1.6 Aims of thesis 

In this Thesis I will describe the Angstrom Project, the work performed by my- 
self as part of the Angstrom Project and present the scientific results which have 
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been produced as a result of this work so far. The main aim of this Thesis is 
to develop a Candidate Selection Pipeline for the Angstrom Project which can 
select, from the totality of variable object lightcurves produced by the ADAP, 
the candidates which are most likely to be caused by microlensing, while simul- 
taneously selecting as few objects as possible which are more likely to be due to 
variable stars, classical novae, supernovae or other non-microlensing phenomena. 
If this primary aim is achieved, then whatever statistical analysis that may be 
possible of any lensing candidates will be performed. For example, it would be 
interesting to examine the spatial distribution of candidates within M31 and the 
distribution of lensing timescales. Any particularly interesting lensing candidates 
will be investigated in more detail. It is expected that a large fraction of the vari- 
able objects found will be due to variable stars, and so any examination of the 
statistical properties of these objects which is possible would be useful. Again, 
the spatial distribution of variable star candidates across the LT field would be 
interesting to investigate, for its own sake, and because of its relevance to the 
spatial distribution of microlensing events, which is predicted to be detectably 
different, given enough detected events. 
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Figure 1.6: A figurative representation of the Angstrom collaboration pipeline. 
This plot is described more fully in Section 11.5.21 



Chapter 2 



Liverpool Telescope Pilot Season 



2.1 Introduction 

This Chapter describes the first "pilot" 
Telescope (henceforth "LT") data and 
them. The robotic operation of the LT 



season of Angstrom using the Liverpool 
the analyses that were performed using 
is also described. 



2.2 Robotic Telescopes 

The LT and FTN telescopes are both "robotic". This means that no human 
operator is required on the mountain at the telescope. Robotic telescope data 
comprised 74.9% of Angstrom unstacked image frames up to the end of Season 
3 (2006/7). The LT and FTN are built to identical designs. The LT is sited at 
the international Observatorio del Roque de los Muchachos on the summit of the 
island of La Palma, in the Canary Islands. The Faulkes Telescope North is located 
on the mountain of Haleakala, on the Hawaiian island of Maui. The altitude is 
10,000 feet above sea level meaning that the telescope sits above the cloud layer 
for the majority of the time. The mean seeing at this site is around one arcsecond. 
Both telescopes are enclosed within a protective dome which is designed to open 
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fully rather than having a narrow slit through which to observe. This innovative 
dome design is referred to as "clamshell". When the dome is fully open, it is 
possible to move the telescope rapidly from one side of the sky to the other down 
to an artificial horizon level of 20° above the horizontal with a slew rate of at least 
2 degrees per second. This ability to slew rapidly is very important in relatively 
new areas of astronomy where a rapid response is required. Some of the areas 
of research in which robotic telescopes are making an important contribution are 
long term monitoring programs, fast transients such as novae /sup ernovae and 
gamma ray bursts. 

Robotic telescopes do not require an operator to c onstantly be at the telescope 



deciding which target to observe and for how long fjBode 



19951 ) . The tel escopes 



are d esigned to operate autonomously, as if there were an operator present (jSteele 



2001) 



by 



Bode 



his le ads to considerable cost and convenience benefits, as pointed out 
( 119971 ). However, it is also possible for an observer to take direct control 
of one of the telescopes remotely over the internet in order to direct it to observe 
particular targets. This capability is used by users in the United Kingdom such 
as schools, through the "National Schools Observatory" to learn about telescopes 
and observational astronomy. A significant fraction of the observing time of the 



LT is allocated for these educational purposes. 



2.2.1 Instruments 

On the LT there are currently four operational instruments. These are: "RAT- 
Cam" , an optical CCD camera, "SupIRCam" , an infrared array camera, "RINGO" , 
an optical polarimeter, and "the Meaburn Spectrometer" . 

The Angstrom project uses RATcam, which has a 2048x2048 CCD which gives a 
pixel size of 13.5 microns, which corresponds to 0.14". Most of our observations 
are made in Sloan i' band, which has an approximate wavelength range of 6930 
- 8670 A, but observations have also been taken in Sloan r' which spans 5560 - 
6890 A. 
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2.2.2 Pre-processing at the LT 

As well as the observations being scheduled and carried out autonomously, the 
data are preprocessed at the telescope and then plac ed on a web site, where it 



20001). In order to 



may be accessed remotely by Principal Investigators flSteeld . 
do this, regular flat field observations are scheduled automatically. The opera- 
tions which are performed automatically at the telescope are: bias subtraction, 
overscan trimming, and flat fielding. 

From operational experience, it is known that RATcam does exhibit a small 
gradient down each column of the image, and a first order fit to this is required 
to remove it. Overscan areas are trimmed from the image leaving a 2048x2048 
pixel image. Cosmic ray rejection is not performed, nor bad pixel masks applied 
at the telescope. Users of the telescope who are attempting to produce accurate 
photometry will need to know exactly what masking has been applied, so it 
is considered better that these functions are performed by individual projects. 
However, the Angstrom project has generated bad pixel masks as a necessary part 
of our pipeline and have made them available for general use. At the moment dark 
subtraction is not part of the LT preprocessing pipeline, (although the facility 
does exist in the reduction pipeline to add it at a future date), as it has been 
determined experimentally that at the temperatures the telescope experiences 
in normal operation dark current is not significant. The optical beams on the 
LT and FTN are vignetted to a small degree by the filter wheel. Each filter 
has a slightly different effect. This vignetting is usually well removed by flat 
fielding. De-fringing is currently not performed at the telescope, although master 
fringe frames, which have been produced by stacking many deep integrations, are 
provided to enable users to perform their own de-fringing. 
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2.2.3 Scheduling 

A sophisticated automated scheduling system takes all the information about 
the projects which have been accepted for observation, for e.g. order of priority 



and o bserving requirements and decides which target should be observed fjSteele 



19981 ). given the current observing conditions and astronomical activity (e.g. the 
occurrence of gamma ray bursts). This leads to a more efficient use of telescope 
time, and the possibility of quickly turning the telescope to observe targets of 
opportunity as required. 



2.3 Introduction to Preprocessing 

At the very start of the author's work on this project some time was spent becom- 
ing familiar with the processes which were necessary to perform preprocessing on 
the raw unstacked fits images and to combine them into stacked images. This 
also allowed familiarity to be developed with some of the relevant IRAF (iTodyl . 
1986 ) 3 routines and to learn some of the basics of C-Shell programming. This 



was to prove very useful for the rest of the PhD project. 



2.4 Preprocessing 

What might be called "initial" preprocessing had already been performed before 
the author began working on the FITS images. This meant that processes such as 
de-biasing, dark current subtraction and flat fielding had already been performed. 

The first few months of study were spent learning to use basic IRAF routines such 
as "IMREPLACE" , "DAOFIND" , "XYXYMATCH" , "GEOMAP" and "IMCOM- 
BINE", which were necessary to perform the preprocessing on the LT images of 

1 IRAF is distributed by the National Optical Astronomy Observatories, which are operated 
by the Association of Universities for Research in Astronomy, Inc., under cooperative agreement 
with the National Science Foundation. 
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M31. The processes performed by the eventual script were: bad pixel masking, 
bright object finding, calculation of image relative translations and rotations (and 
possibly scalings) and image stacking. Some quite extensive experimentation was 
required to find suitable parameter ranges for these routines which would work 
for all images and produce useful output which could then be used by the next 
step in the chain. The actual operations performed on the images were as follows: 

IMREPLACE was used to replace the values of groups of individual pixels with 
the number -1. This was chosen as a minimum value of was set later, in 
IMCOMBINE, resulting in values less than being ignored. The groups of pixels 
that it was necessary to replace were a strip of 5 pixels along the bottom of each 
image which were bad due to an error in the previous processing, and a vertical 
strip 1 pixel wide, starting at pixel coordinate (753,1) and ending at (753,2048), 
which appeared to be a bad column on the CCD. 

Next, DAOFIND was used to fit Gaussian peaks to areas in the images that 
were more than 3.5a brighter than their surroundings. This was done to find the 
positions of bright stars in the image that could be matched to the equivalent stars 
in other images. The value of a was one of the variable parameters which had to 
be found by experimentation. Since some of the images in the data set had longer 
exposure times than others, the average brightness of the images, and hence the 
value of a which produced the best results, was not constant for the whole data 
set. This had to be borne in mind when processing the data automatically. The 
first images taken of M31 were generally longer and hence fewer each night, but 
due to telescope tracking difficulties, a large proportion of these were deemed 
unusable due to the telescope moving during the exposure. To counteract this, 
by reducing the number of unusable images, the exposure time of later images was 
reduced. Therefore, the "early-long" and "late-brief images had to be processed 
separately using different values of a. The values of a were chosen so as to produce 
between 15 and 40 hits. By visual examination of the output from DAOFIND 
in the form of lists of the coordinates of the "stars" found, it was possible to 
see that DAOFIND was finding relatively large numbers of hits in the areas of 
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the images where the core of the galaxy is. These are assumed to be in most 
cases merely random fluctuations in the background brightness, and not stars. In 
an attempt to ignore these, a parameter in the function XYXYMATCH which 
sets the minimum separation allowable for "stars" was set to 150 pixels. This 
eliminated almost all the spurious hits as they tended to be clustered together 
quite tightly, and hence much closer together than the real stars, which were 
separated by significantly more than 150 pixels in most cases. In any case it is 
not necessary to find all the stars in the image, just enough to be able to match 
them to other images, i.e., a minimum of one to find translations, and two to find 
any rotation between images. A few more stars than this are useful, however, to 
provide error checking and to increase the accuracy of the matching. 

XYXYMATCH was then used to take the lists of stars produced by DAOFIND 
and to find matches between the stars in each of the images and thereby calculate 
the transformation between each star in an image relative to a reference image 
in each coordinate direction. 

The output from XYXYMATCH could then be input into the GEOMAP function, 
which uses the coordinate transformations in a polynomial fit to calculate a mean 
translation in each coordinate direction and also a rotation of each image relative 
to the reference image. 

Originally, the rotation of the images was fixed first as a separate step using 
the IRAF function "rotate", but in the final pipeline, XYXYMATCH is used to 
calculate the required rotations (and translations) and then "GEOMAP" is used 
to perform all the transformations simultaneously. 

The translations produced as output of GEOMAP were originally used as input to 
the IMCOMBINE function. This takes all the images in an epoch and lays them 
on top of each other, shifting them horizontally and vertically by the translations 
calculated previously. The values of all stacked pixels were averaged, and the 
brightness of the image was scaled by the median pixel value in a square area of the 
image centred on the middle, the bottom left corner being at pixel (724,724) and 
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the top right being at (1324,1324). The function "crreject" within IMCOMBINE 
was used to reject pixels that are more than 3 times the expected CCD readout 
noise (a) different than the area immediately around them. This is done in an 
attempt to remove most cosmic rays. 

In order to facilitate the automatic processing of large numbers of images, IRAF 
commands from within UNIX were inserted within a Unix script to perform the 
running of IRAF functions automatically. This enabled the whole directory of 
images to be processed in one long computing run, rather than doing each epoch 
semi- manually. 

When this automated script had been perfected, it was passed over to Dr An- 
drew Newsam (Liverpool JMU), who incorporated the principles and processes 
contained within it into his own more extensive script, called "stackEpochLT.csh" 
which tied together pre-processing and photometry of the pilot season data. 

2.5 Investigating the Common Overlap Region 

During the LT pilot season there were major problems with the pointing accuracy 
of the LT relative to the intended (specified) field. Epoch images had considerable 
translational and rotational errors relative to one another (which could be of the 
order of hundreds of arcseconds and tens of degrees) which had the effect of 
severely reducing the common overlap between them which could be used to 
produce time-series of difference imaging data for the purposes of finding time 
varying sources. The reason that this investigation became necessary was that 
the original ISIS image subtraction package was unable to handle image masks. 
Therefore the region on which ISIS was able to produce lightcurves was the region 
in which every image in the stack overlapped, which was reduced every time a 
further misaligned image was added to the stack. At that time, the question 
of what was the best available compromise between overlap area and number of 
time points (equivalent to the number of images in the stack) in each lightcurve 
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was one that needed to be clarified. This compromise would be the one in which 
the total amount of usable data was optimised. 

In order to investigate this issue, again using IRAF software, an image was pro- 
duced for one example reference image which showed the pattern of overlaps, and 
the relative "image density" of various regions of the combined image (i.e. how 
many images were coincident at that point of the super- image). Visual inspection 
of this image implied that the number of overlapping images fell off very rapidly 
as one moved away from the region in which all the images overlapped, implying 
that only a small gain in coverage area might be made if one relaxed the con- 
straint on the number of overlapping images. This was investigated further more 
analytically, as described below. 

In order to discover whether it was possible to make a gain in the area covered by 
all images considered, by removing some (hopefully small) number of images that 
were outliers in position and/or rotation, and thereby the cause of a dispropor- 
tionate reduction in overlap analytical study was made of the distribution 
of images with respect to one another in displacement and angle. Several scripts 
were developed with the aim of deciding firstly whether there were any images 
which were having a particularly deleterious effect on the overlap area, and if 
so, which ones and how many should be discarded. The "Quality Factor" which 
needed to be optimised relates to the total amount of microlensing information 
available in the images, and is equal to the number of images used multiplied 
by the overlap area remaining, i.e. Quality Factor QF = N ■ A. The idea was 
to discover which of the images were particularly bad outliers, and then remove 
these one by one and calculate the QF at each stage to discover whether it had 
a maximum and if so, at which point. It was important to sort the images in 
order of "centrality" since the remaining area after n images depends sensitively 
on the order in which these are added to the stack. To maximise the remaining 
area and hence the quality factor, the best images must be added first and the 
worst images last. 

The most accurate way (i.e. requiring no approximations) to do this would be 
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the "brute force" method- trying every combination of reference image (the im- 
age upon which the overlap area of all other images is calculated) and other 
images. A script was written to attempt to accomplish this, starting with the 
object (star coordinate) files for each image previously found by starfind, and 
calculating from first principles the rotation, the x and y shift, and hence the 
overlap area. However it was found that the computing time using this method 
was too great, so it was clear that an alternate method would be better. There- 
fore, a method was developed whereby for each reference image a weighting factor 
would be calculated for every other image based on that image's displacement 
in the two coordinate directions and angular rotation relative to the reference 
image. The total weighting Wt was calculated by calculating geometrically the 
area of "overhang" i.e. the non-overlap of the image considered with the reference 
image due to the displacements in the three directions independently, and then 
multiplying these together as in Equation 12.11 



W T = W x W y W e (2.1) 

where the individual weights in the three directions W x ,W y and We were defined 
by the Equations 12.21 12.31 and 12.41 respectively. 



At 

W. = \—J (2.2) 



Ay 

^ = 1-^1 (2-3) 

where Xj m and Uj m are the dimension of one side of an image in whatever units 
are used (here x« m = yi m = 2048 pixels) 



W e = 2sin(|A0|) cos(A0)/(sin(|A0|) + cos(A0) + 1.0) 2 (2.4) 
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where A0 is the rotation angle in radians minus n/2. In order to make the 
three terms have the same maximum possible magnitude, the x and y terms 
were scaled by a factor = 2/(2 + a/(2)) 2 so that when = A^, (where 
A^ is the maximum possible overhang area due to rotation which occurs at 45 
degrees if the sides of the original square image are aligned with the coordinate 
axes), the weight W x = 1. In other words the x and y weights could correctly 
have a larger effect since the overhang areas can theoretically be equal to the 
total area of the image, whereas the weight cannot. Once the single iteration 
of the original overlap area script had been run, it was realised that once the 
translations/rotations between one reference image and all the other images were 
known, the translations/rotations between any other pair of images could be 
calculated by geometry, i.e. to go from image B to image C with image A as 
a reference, the reverse of the AB transformation is used followed by the AC 
transformation. This method was then used to calculate the transformation for 
all other combinations of images in order that the weightings for each image 
pair could be calculated. Once these weightings had been calculated, the median 
weighting for each reference image was calculated. This gave information about 
the geometrical relationship of the reference image to all the other images. A 
high median weighting in this case implied that the particular reference image 
is widely separated from the centroid of all the other images. In addition the 
distribution of these medians was plotted to give some idea whether there were a 
few outliers which were far from the bulk of the images or whether all the images 
were close together in median and hence no great improvement could be expected 
by removing the worst images. By calculating the medians the images could be 
ordered in (estimated) overlap area. Then the images were stacked in the order of 
"goodness", and the remaining overlap which included ALL the stacked images 
was calculated, along with the quality factor. Then the images with the worst 
median weightings (i.e. the images that were, on average most separated from 
the bulk of the rest of the images) were removed from the stack one by one in 
order of "badness" , and the overlap area and quality factor re-calculated. When 
this was done for different reference images, the same two images were found to 
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have the worst positions. These were the observation from 22.10.2004 followed 
by the one from 15.11.2004. For the "best" reference image, which was taken 
on 21.08.2004, removal of these two images resulted in an increase in the overlap 
area of all the remaining images from 0.409 to 0.448 of the original image area, a 
fractional increase of 9.5%. For this reference image, the quality factor increased 
by 7.5%. Removing more than these two worst images from the original stack 
of 56 (Lband) epochs resulted in a consistently lower quality factor, as expected. 
The plot of overlapping images for the stack of images is shown in Figure 12.11 
showing the kind of overlap region which existed at this time. Due to it having 
been shown above that the overlap region of all images was rather small (less 
than 50%) and therefore much information would otherwise be lost in the edges 
of the image, the ISIS code was modified by us. It was decided that a more 
efficient method of utilising more of the information in the images would be to 
make a "super image" which was a rectangular frame of dimensions which were 
large enough to include the furthest out pixel on the rotated images so that all 
the image pixels from the original images were utilised and no information was 
discarded at this early stage. Later on in the process, in the early stages of the 
candidate selection pipeline, cuts were used to discard lightcurves (equivalent to 
image stacks of one pixel) containing less than a certain number of data points 
(images) . 

2.5.1 LT/FTN difference image assessment +masking 
after the second season 

In the first two seasons of LT and FTN images there were many problems, partic- 
ularly with the FTN, which caused several different kinds of degradation of the 
resulting difference images. In some cases the flaws on the difference images were 
localised and could therefore be masked out, and in other cases they were more 
distributed across the whole image. In these decision had to be made on 

whether the image was usable at all. After the end of the second season of data 
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Figure 2.1: The overlap image (all 49 images collected up to that point included) 
for the LT i-band reference image LT_i_20040816_001, showing the variation in 
density of images over the overlap region. The colours encode the numbers of 
images overlapping in each area, which generally increase towards the centre, in 
which all 49 images overlap in the black region. In the magenta regions (for e.g. 
the top right corner, only the reference image exists.) 

taking, i.e. in mid-2006, all the LT and FTN images to date were examined by 
eye, to assess the quality of the images and to assess their usability or to discover 
any flaws in the difference images. These images numbered 174 LT and 167 FTN, 
subdivided into LT i'-band (130), LT r-band (44), FTN i'-band (85) and FTN 
R-band (82). The problems/flaws with the images could basically be grouped 
into ten categories, which are listed below. 

■ Large Bipolar galactic centre 

■ PSF mismatch 

■ Dark and/or light blobs 
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• Edge effects 

■ Corner shading/lightening 

■ Fringing 

■ Stripes/lines 

■ Smearing 

■ Oil-Speckling 

• General Noisiness 

• Large PSF 

For each difference image, these flaws were noted, where present, and in addition 
the image was given a general classification as "C" , "P" , or "U" , where these are 
defined as: C = "Completely or almost completely usable" P = "Partially usable- 
some significant fraction of the image can be used" U = "completely Unusable 
at that moment, although some may have been correctable". In total over all 
images, 87 received a "P" classification and only 6 a "U". However, the "P" 
classifications were clearly clustered in the FTN R-band category- 46 out of the 
total of 82 images had this classification. Many of these were due to some extent 
of "Speckling/stippling" , which was mostly concentrated in the brighter regions 
of the image, i.e. around the galactic centre. The cause of this effect was thought 
to be an oil leak which occurred on the FTN which unfortunately sprayed the 
surface of the CCD chip, which could not be immediately replaced. 

Examples of difference images containing some of the categories of flaw described 
above are shown in the images below, Figures 12.21 to 12. 6[ which are all selected 
from LT i'-band data. 

In the top image of Figure I2.2[ a minor scattered light problem has lead to a poor 
fit to the overall galaxy model image, whereas in the bottom image of Figure 
12. 2\ the cause or causes of the noisy galactic centre to this image are not clear 
but may possibly be due to a poor flat field. The top image of Figure 12.31 shows 
darker stripes which, it is thought, were caused by scattered moonlight. This 
problem was fixed on the LT by the installation of new baffles in 2007. After 



2.5. Investigating the Common Overlap Region 



47 



this modification a very significant improvement in the data quality from the LT 
was seen. The middle image of Figure 12.31 shows dark horizontal lines which are 
probably due a slight misalignment occurring in the difference imaging process. 
The bottom image of Figure [23] shows both a "crescent moon"-shaped smearing 
of the background which is likely to be caused by scattered moonlight and also 
two clear lighter lines, the cause of which is unknown. The two images in Figure 
12.41 may both involve scattered light clS cL CcLUSC The top image has one side 
lighter than the rest of the image, possibly caused by scattered moonlight. The 
left hand side of the bottom image in Figure 12.41 is also much lighter than the 
rest of the image. This may be caused either by a serious scattered light problem 
or perhaps a total misalignment in the difference imaging pipeline. All three 
images in Figure 12.51 contain "doughnut-shaped" dark and light artefacts, which 
are caused by out of focus dust particles. Figure 12.61 shows a good example of a 
failed def ringing process. 
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Figure 2.2: Examples of faults in the difference imaging (1). Top) This is a more 
serious example of a particularly common phenomenon in the difference imaging. 
A minor scattered light problem has lead to a poor fit to the overall galaxy model 
image. Bottom) The cause or causes of the noisy galactic centre to this image 
are not clear but may possibly be due to a poor flat field. 
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Figure 2.3: Examples of faults in the difference imaging (2). Top) The darker 
stripes in this image were caused by scattered moonlight. This problem was fixed 
on the LT by the installation of new baffles in 2007. After this modification a 
very significant improvement in the data quality from the LT was seen. Middle) 
The dark horizontal lines are probably due a slight misalignment of the difference 
imaging. Bottom) The cause of this problem is currently unknown. 
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Figure 2.4: Examples of faults in the difference imaging (3). Top) An image 
with one side lighter than the rest of the image, possibly caused by scattered 
moonlight. Bottom) This image also has one side much lighter than the other. 
This may be caused either by a serious scattered light problem or perhaps a total 
misalignment in the difference imaging pipeline. 
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In the cases where the flaws with the images were localised and could there- 
fore be masked out, for example, "Dark and/or light blobs" "Edge effects" or 
"Corner shading" , this w as done by creating masks using the facility in the DS9 



(jJoye and Mandel 



20031 ) program for creating geometrical "region" (.reg) files. 
These contain data about the shapes and sizes of whichever geometrical shapes 
were used to compile the desired mask, and are in a very similar format to that 
read in by the IRAF "mskregions" task. A brief script was used to convert the 
.reg files into IRAF readable ones, and then IRAF was used to create image 
masks which consisted of mainly zeros, with ones where the masking was to oc- 
cur. The position of each variable object from the variable object database was 
then checked against the masked areas in each image in the lightcurve in turn, 
and data points which fell inside a masked region were removed. Each time a new 
difference imaging photometry pipeline run (with different parameters/algorithm 
or a new reference image) was performed, this masking had to be repeated as the 
positions of the variable objects found by the DIA would be different. 

During the Third Season, it was discovered fortuitously that one of the causes 
of some of the poor difference images was that the reference image was slightly 
mis-aligned from the rest of the data. This had previously had the effect of 
producing a noticeable "bipolar" effect on the difference images, which caused 
bright sources, particularly the galactic centre, but also variable objects to be 
darker than average on one side of the source position and lighter on the other. 
This caused the flux contained in the object to be spatially smeared out, and 
hence lowered the signal to noise of our lightcurves. When fixed, the lightcurves 
appeared noticeably better by eye. The dark edge pixels had also been fixed by 
this time in the improved pipeline. Therefore, the decision was made to not go 
through the tedious process of re-creating the masks from scratch (which would 
then have been for many more images than before) as it was not felt that the 
potential gains would justify the extra work, given the now-improved quality of 
the difference images. 




Figure 2.5: Examples of faults in the difference imaging (4). Top), Middle) and 
Bottom) All three images are likely to be examples of a poor flat field. The 
doughnut shapes are caused by the out of focus images of specks of dust 
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Figure 2.6: Examples of faults in the difference imaging (5). This image clearly 
shows a good example of fringing which has not been successfully removed. 



Chapter 3 

Angstrom Pipelines and Data 



3.1 Introduction 



In this Chapter, the operation of the Angstrom Project Alert System is briefly de- 
scribed. Then, the operation of the data analysis pipeline is summarised. Finally, 
the data set upon which the forthcoming analysis is based is described. 



3.2 The Angstrom Project Alert System (APAS) 

The normal mode of operation of the Angstrom Data Analysis Pipeline (ADAP) 
is to process the data collected over one observing season offline at the end of that 
season. However, the ADAP is also used to process 1024x1024 re-binned copies of 
the 2048x204 8 robotic telescope im ages which are small enough to be processed 



in real time ( iDarnley et al.l . l2007al ). Many of the stages of this processing are 
the same as used by the main offline Pipeline, and so a brief summary is given 
below. An initial quality control check is made on the images based on the size 
and shape of the PSF using bright foreground stars. Images that fail this check 
are not used in the following Difference Image Analysis as to do so would result 
in lowering the quality of the lightcurves produced. On the images which pass 
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this test, the APAS performs defect masking, cosmic ray rejection and image 
alignment. Next it defringes each image, using an iterative method, described 
below in 13.3.11 AngstromlSIS is used to PSF match and then stack groups of 
images taken sequentially in one run of data-taking at the same epoch. One image 
stack, usually the one with the smallest PSF, is selected as the reference image. 
This is convolved with, and then subtracted from, the other images to create 
a sequence of difference images. From each stack of images, a likelihood map 
of variable sources is constructe d by using a n indicator of statistical significance 



which is similar to that of Cash ( Cash 



19791 ) shown in Equation 13 . 3 1 below . From 



these maps, discrete sources are identified and positionally matched with previous 
epochs. A master list of object positions is maintained and updated with new 
objects if they have positions which have not been seen before. Photometry 
is performed on the difference images using PSF fitting. The interval between 
observation and adding new photometric points to the database is about 2 hours. 
The legacy POINT- AGAPE data are also analysed by the same pipeline, off-line, 
in those regions that overlap with the LT/FTN field. 



3.2.1 Candidate Selection Criteria of the APAS 

There follows a brief description of the criteria used by the APAS to decide 
whether a particular object should be (currently) classed as "flat", "variable", 
"followed", "alert", or "old". 

If the lightcurve of an object has no detected peaks, then it is classified as "flat" . 
A "peak" is defined, basically, as a cluster of five points which are at least 5cr 
above the baseline for that lightcurve. The baseline estimate is determined by 
the median lightcurve flux. Up to three non-significant points are allowed within 
a peak, with the provisos that these points occur during a period of bad seeing 
and they are followed by at least one more significant point within the same 
observing season. This classification can of course change with the addition of 
new data points. If the lightcurve contains no peaks within the current season, but 
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does have at least one peak in one or more previous seasons, then it is classed as 
"variable" and is also subject to re-analysis when there are new data. If an object 
lightcurve does have a peak in the current season, and also has a peak or peaks 
in a previous season or seasons, then it must still be analysed by the alert system 
because of the high probability in the central regions of M31 that blending exists 
in the lightcurves of transient and microlensing events due to the crowding of 
variables, leading to variations in the baselines of these e vents in a similar way, for 



Paulin-Henriksson et al. 



e.g., t o microlensing candidate PA-99-N1, as described in 
(120031 ). Thus objects with both a variation in the current season and a previous 
season are not necessarily automatically classed as "variable" . If the most recent 
variation has an amplitude above a given threshold in the reference image and is 
at least 50% greater than the highest previous variation then it is still classified 
as a transient, otherwise it is classified as a "variable". All surviving objects are 
considered to be transients and are therefore flagged as "followed" . The FTN and 
POINT-AGAPE lightcurves (where these exist) are then scaled to the reference 
image of the LT data, using the previously calculated flux scalings described in 
Section 14.2.101 Next, the lightcurves are fitted with a reduced Paczyhski curve 
(see Equation II . 15)) . It is required that the time of the peak flux of the fitted 
curve does not occur before the first observation of the current season and also not 
after the anticipated start of the next season. Also, the duration of the event, as 
measured by the full width half maximum time ^fwhm j is required to be within 
the range 1 < £fwhm < 365 days. The last criterion is that the amplitude of 
the event is positive as all physical phenomena that might be of interest (novae, 
microlensing) are positive deviations from a baseline. If an object passes all the 
above criteria it is classed as an "alert" . Any object failing any of the criteria is 
still classed as "followed" , unless it was previously classed as "alert" , in which case 
it is re-classified as "old". The results of the APAS analysis are displayed on a 
web interface for a human observer to inspect. The final decision as to whether to 
alert a particular object is currently left to this observer. The APAS also displays 
lightcurves for all variable objects within 3" of the object of interest, which assists 
in assessing whether an object might be a real alert but contaminated with light 
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from a nearby variable object. 

The first alert ever given by the system, from the 2005/6 observing season, shows 
special promise as a possible microlensing candidate. This appears to be a high 
signal to noise, very short timescale (£fwhm ~ 1 day) microlensing event. This 
event has been the subject of a study by the author, the details of which are 
reported in Chapter (jSJ). The security of the identification of this event as lensing 
has been increased by the addition off-line of some reduced data from the Maid- 
anak telescope, which confirm the gradual smooth rise of the lightcurve, greatly 
reducing the chances that this could be an extremely short duration classical 
nova. 

3.3 The Angstrom Data Analysis Pipeline 
3.3.1 Data Collection 

During the first "pilot" season, only the LT, Doyak and Hiltner telescopes were 
available. 86 frames of data were obtained from the BOAO 1.8m telescope, 418 
frames from the LT in i' band plus 550 in r band along with 453 frames from 
the Hiltner telescope through the MDM observatory in Arizona. However, in 
the second season, the FTN also became available through RoboNet open time, 
and hence the rate of data collection increased markedly as compared with the 
pilot season. The FTN contributed 2592 frames out of the total of 4447 for the 
season, 1303 in i' band and 1289 in R band. The next largest contribution was 
from the LT in i' band, with 984 frames, and then the Hiltner (481) and Doyak 
(390). In the Third Season, the FTN R band observations were discontinued, 
but a new telescope, the Maidanak in Uzbekistan was added to our collabora- 
tion, contributing 352 frames of data. The LT (i') obtained 973 frames, the 
FTN (i') 600, and the Doyak took 286. Also very important to the project were 
591 stacked epochs from the POINT-AGAPE project, which span the four sea- 
sons 1998/9, 1999/2000, 2000/2001 and 2001/2. These data were from the two 
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POINT- AGAPE (henceforth, "PA") fields 2 and 3 which overlap substantially 
with the top and bottom part of the Angstrom field. The above information is 
summarised in Figure 13.11 and in Table 13.11 It can also be seen from Figure 13.11 
that the majority of the data collected were from LT and FTN together, with 
relatively smaller contributions from the other three non-robotic telescopes. It 
should be noted that no attempt has been made to scale the plot according to 
exposure length, so this graph does not accurately reflect the amount of real in- 
formation contributed by each telescope, just the number of epochs. At the end 
of the Third Angstrom Season in February 2007, 8165 "raw" (unstacked) frames 
of data had been collected by the collaboration. 
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Figure 3.1: Plot showing a summary of the accumulation of Angstrom data during 
the first three seasons. 

The structure of the Angstrom Difference Imaging Pipeline can be summarised 
in eight sections as follows: 

■Quality control 

■Defringing 

■Alignment 

DIA 

■Source Detection 











Telescopes 










Seasons totals 




Maidanak 


Doyak 


LT 


FTN 


Hiltner 


PA 2 


PA 3 










i' 
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Pre- Angstrom 
















/306 


/285 


/591 


Season 1 (pilot) 
Season 2 
Season 3 


352/28 


86/20 
390/32 
286/12 


418/54 
984/82 
973/121 


550/44 


1303/83 
600/- 


1289/70 


453/55 
481/13 






1507/173 
4447/280 
2211/ 161 


Telescope Totals 


352/28 


762/64 


2375/257 


550/44 


1903/ 83 


1289/70 


934/68 


591 




8165/614 



Table 3.1: Table summarising the data collected by Angstrom over the first three seasons. In each line and column, the data 
displayed as "unstacked exposures/stacked epochs". 
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■Merge lists of newly detected and previously detected objects 

■Perform photometry of new objects in all observations 

■Perform photometry of all objects in the current image 

These eight operations are performed on each epoch of data in "real time" i.e. 
the epoch is analysed before the next epoch arrives. Currently, the Pipeline is 
operating on a reduced data set consisting of "binned" images which contain 
1024x1024 pixels, as opposed to the original 2048x2048 pixels of the original 
LT/FTN CCDs. It was decided to do this so that the Pipeline would be capable 
of analysing one night's data in less than 24 hours. 

The most complicated of the sections above and most costly in terms of computing 
time are the first three, "Quality Control, Defringing and Alignment" . 

Stacking 

The first thing that is done is to apply the mask of known bad pixels for each 
camera. The routine then attempts to "fix" the values of the rejected pixels by 
interpolating across them using all good data in surrounding pixels. A mask is 
then made consisting of the bad pixels. All subsequent operations in the Pipeline 
are also performed on these mask images as well as on the "good data" images, 
to allow the relative contribution of bad pixels to every other pixel after all other 
operations have been performed to be assessed. Next, an initial guess is made 
at the PSF of the epoch, using the IRAF task "starfind" . This routine searches 
for "extended point-like" objects that have a peak flux that is above a given 
threshold value. The background is estimated locally by starfind, which is an 
important requirement for our observations. It then fits an elliptical Gaussian 
profile to these pixels and hence estimates their FWHM and ellipticity. The 
average of these two properties from the majority of the starlike objects found in 
each image (some extreme outliers are rejected) is taken as the initial estimate of 
the PSF size and shape. On the basis of this, individual observations are classified 
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as "Good" "Bad" or "Ugly" in terms of the size and ellipticity of the PSF. The 
criteria used for this categorisation for the reduction of the data investigated in 
this thesis are: 

If the "Roundness" or ellipticity, which is defined as in Equation 13.11 where a x 
and o y are the widths of the fitted Gaussian profile along its major (x) and 
minor (y) axes respectively, is larger than 0.6 or the FWHM of the PSF is more 
than 11.0 pixels then the observation is classified as "Ugly". Otherwise, if the 
"Roundness" is between 0.4 and 0.6 or FWHM is between 8.75 and 11.0 pixels 
then the observation is classified as "Bad". In all other cases the observation is 
classified as "Good". 

"Good" and "Bad" classifications are treated in the same way, but "Ugly" obser- 
vations are rejected entirely and not used further. 



The next task that is performed is cosmic ray removal which utilised another 
IRAF task; "craverage". This task requires the estimate of the PSF calculated 
above as a parameter in order to decide what is a cosmic ray (which are on the 
whole very compact, with little energy spreading to surrounding pixels) and what 
might be a "real" feature of the image. 

Finally, a thin border, currently 4 pixels in width for binned data, is masked out 
around the edge of the images to remove edge effects (over-bright or over-dark 
pixels) which occur in the outer few lines. 

Alignment-part I 

The first thing that is done is to align the centroids of point-like sources in the 
images. 

Point-like sources in the images are found using the IRAF task "starfind" , using 
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the previously calculated estimate of the PSF as a guide to the size of object 
the routine should initially expect to find. If the number of sources found is 
insufficient (currently defined as less than 8) then the image is not used in any 
following calculations. If sufficient objects are found, then the list of found objects 
is compared to the list of stars from the astrometric reference image (the DIA 
reference image could be different but is aligned to the astrometric reference 
image). The above process is repeated for all cameras being used. Pairs of objects 
in the two lists are matched up using the IRAF task "XYXYmatch". Again, if 
there are insufficient matched pairs then the image is dropped. If sufficient pairs 
are found then the transformation between the current image and the reference 
image is calculated using the IRAF "geomap" task, initially allowing three degrees 
of transformation; a translation, a rotation and a scaling factor. As a sanity check, 
the calculated scaling factor should be very close to 1. If the scaling factor is not 
1, the limits being 0.95 < magnification < 1.05, then the image is dropped. All 
geometrical transformations on images within each epoch should be similar (this 
is checked). If this is true then the object matching stage is repeated, this time 
giving the average of all calculated transformations as a "first guess" . This may 
occasionally find a few more pairs of objects than before and consequently improve 
the calculated transformations. On the second iteration, the transformations are 
calculated allowing only a shift and a rotation. At the same time the inverse 
transformation is calculated. 



Defringing 

A subset of Angstrom i' band observations contain fringe patterns which are 
clearly visible by eye. For an example, see Figure 12.61 These are thin-film inter- 
ference patterns which originate in multiple reflection from the top and bottom 
surfaces of the CCD. The effect of these fringes may be increased in the difference 
images when the fringe patterns are misaligned, causing a significant modulation 
in the background flux level. 
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Fortunately, the fringe pattern does not change much over time. Therefore, a mas- 
ter fringe frame, which may be labelled F, can be built up by adding a number of 
images, and then scaled and subtracted from a target image to minimise fringing 
in that image. However, in the case of the signal of interest being of the same 
order of magnitude as the level of fringing and much lower in magnitude than the 
background flux level, as in our data, this can be computationally intensive. For- 
tunately, we are able to take advantage of the problems experienced in the pilot 
season with large pointing misalignments and rotator problems (see Section 12. 5p 
on the LT. By adding together all the aligned frames, the now-randomly-oriented 
fringe patterns largely cancel out, giving a first order fringe free template, M. 
This gives a good first estimate of the background galaxy light distribution in 
our images. The fringe level in the individual images can be computed and then 
removed by minimising the sum: 



over all pixels i, where I is the target image to be defringed, and m, / and s are 
the scalings and offset respectively to be computed. By using the stack of first 
order defringed images as a second order fringe-free model, and repeating the 
process above, the level of fringing can be further reduced. No obvious evidence 
of residual fringing is found in the images after this second level of defringing, so 
the iterative process does not need to be continued any further. 

Due to the similarity between the LT and FTN, which includes CCD cameras and 
filters, it is possible to use the LT fringe-free frame as the first order FTN image. 
Unfortunately, it has not yet been possible to produce reliable fringe frames for 
the FTN, so to date the FTN has perforce been processed without defringing. 

For the Maidanak telescope, a first order fringe-free image is produced by taking 
a series of dithered observations of the Maidanak M31 field. These observations 
are combined in the same way as for the LT data, and the above algorithm used 
to remove the fringes. 



N 




(3.2) 
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Alignment-part II 

Using the previously calculated geometric transformations, the re-alignment of 
the current image into the frame of the reference image is performed. The same 
re-alignment is also performed on the bad pixel masks. After the re-alignment 
has been completed, the "starfind" routine is once again used to make a new 
determination of the PSF. This will usually be the same as when it was previously 
calculated. 



Stacking 

The images in each epoch are stacked. The bad pixels, as given by the masks, 
are not included. In order to calculate the new stacked pixel values, if there are 
less than five images in the stack then the mean value is taken but if there are 
5 or more images then the median value is taken. A new mask is then created 
giving information about how many images contribute to each pixel. 



Difference Imaging 



Angstrom Project DIA 

The difference i maging; (see Section 



sion" of the ISIS (lAlard and Lupton 



.4.31) is performed using the 'Angstrom ver- 



19981 ) image subtraction program, "Angstro- 



mlSIS" . The kind of reference image used has changed along with the development 
of the DIA pipeline. For the 2007 photometry, which was used for the analysis of 
variable stars contained in Chapter El a single image was used. This was chosen 
as one of the best DIA images, with a PSF which was as small and round as pos- 
sible. This does mean, however, that when the reference image is subtracted from 
itself the result should be very close to zero, and this means it is not possible to 
correctly estimate the error on the point. For this photometry, therefore, the flux 
point corresponding to the reference image was not used as its error was untrust- 
worthy (see Section 14.2. 3p . For the 2008 photometry, used for the microlensing 
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analysis (also in Chapter \5§ a reference image was constructed from a stack of 
good images, which removed the problem with error estimation and hence the 
need to remove the corresponding flux point from the fitted photometry. 

Parameters used for the DIA include: 
Number of stamps along the X and Y axes: 20 

Degree of polynomial to fit the background variations: 2 

Sigma of first Gaussian component = 0.7 

Sigma of second Gaussian component = 2.0 

Sigma of third Gaussian component = 6.0 

Degree associated with 1st Gaussian = 6 

Degree associated with 2nd Gaussian = 4 

Degree associated with 3rd Gaussian = 3 

Degree of the fit of the spatial variations of the Kernel = 3 

The best values for these parameters were found by experimentation, using our 
image data. 



Source Likelihood Map Generation 



For each pixel in each difference image, an elliptical Gaussian of known widths 
is fitted. Only the peak flux is allowed to vary. From the fitting, the best fitting 
peak flux and a statistic which is similar, but not ide ntical to y 2 are obtained. 
This statistic is a modified version of the Cash statistic f lCashl . 119791 1 . The original 
Cash statistic was designed for use with low photon counts, and has been modified 
for use with high photon counts (when approximating the Poisson noise as being 
drawn from a Gaussian distribution is reasonable). At a given point (or pixel) in 
a difference image, the modified Cash statistic (C moc i) is determined by fitting a 
simple model for the PSF, and minimising the sum 
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C^ = E( W ~ 6 t,~ mi(/) ) 2 (3.3) 

over a suitably large region around the nominal position. In Equation 13. 3[ di 
are the pixel counts at pixel i, hi is the local background, rrii(f) is the value for 
the PSF model, centred on pixel i and scaled by flux /, and Oi is the estimated 
photon noise for pixel i. Only the flux / is varied in the fit. 

Two more maps are produced: One containing the fitted fluxes and another with 
the "likelihoods", C mo d 2 . At this stage, known bright (foreground) stars and the 
centre of the galaxy are masked out. These masked areas are also scaled in size 
with the changes in the PSF. Source detection is performed on the likelihood 
map by considering all pixels above a given threshold (of value 2.0). For sources 
found in these pixels (which are defined as peaks in the value of the likelihood 
distribution), the peak likelihood value and centroid of the pixel values is found, 
which is assumed to be a good estimate of the position of the source. A list of new 
sources found in the current epoch is compiled. The "new" and "old" source lists 
are merged, using the following algorithm: For each object in the "new" object 
list, the program goes through the list of old objects and finds the closest match 
to it in position. The separation (measured in positional error bars) is calculated. 
Using two threshold constants "ni" and "tV, whose values are 2.0 and 6.0, if the 
position is greater than n^o away from any other object, then it is considered to 
be a new object, and is added to the list of objects. If the position is between n2<J 
and n\G then the object is probably an old object but this is not certain enough 
to use the position to improve the average position. If the position is closer than 
ni<j, then the object is considered to be a match with the old object, and the 
new position is combined with the previous average position of the old object and 
hence improves the knowledge of its position by reducing its error. 
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Photometry 

The pixel corresponding to the position of each object is found, and the sum 
of the flux in the PSF of the extended object surrounding that pixel is read. In 
order to calculate the error on this flux, the values of pixels taken from around the 
central pixel (the pixels used have to be independent, so they are more than the 
o of the PSF apart and in rings centred on the nominal pixel) are read and their 
scatter is used as an estimate of the likely error on the central pixel. Photometry 
is then performed on all objects in the merged (old + new) object list in the 
current difference image. 

Spatial Distribution of variable objects 

The spatial distribution of the final set of all 93240 variable objects from which 
the objects selected in this thesis and reported in Section 15.5.21 were selected, is 
shown in Figure I3T21 The M31 centre is defined as being at (0,0). The blank 
area around this and various other patches is due to these areas being masked 
out in the difference images. This was done for areas around bright resolved stars 
and the centre of the galaxy because image artefacts caused by the difference 
imaging process combined with Poisson noise due to the the large central surface 
brightness of the centre of the galaxy make detecting real variable objects in these 
regions impossible in the difference images. 
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Positions of Angstrom Variable Objects; as of November 2008: 93240 lightcurves 
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Figure 3.2: The spatial distribution of the 93240 objects in the Angstrom Variable 
Object Database at 27th November 2008. Units of X and Y are arcsecs on a square 
image: (the LT frame is 4.5' (= 270") on a side). The M31 galactic centre is at 
(0,0). Square gaps are due to masking of resolved bright stars. 



Chapter 4 

The Candidate Selection Pipeline 



4.1 Introduction 

In this Chapter the operation of the candidate selection pipeline is described and 
explained. Some aspects of the development process are detailed. The selection 
process used to select microlensing candidates after the main pipeline has been 
run is described. Finally, the investigation of the ratio of flux amplitudes between 
the POINT-AGAPE data and the other telescopes, using variable star candidate 
light curves is also described. 



4.1.1 Modelling periodic variable star lightcurves 



It was known from previous microlensing surveys in Andromeda that in all like- 
lihood large numbers of periodic variable stars such as Mira variables would be 
found in the Angstrom catalogue of variable objects. Many previous surveys 
have automatically rejected any lightcurve that shows any evidence of periodic 
variability as a suitable candi date to claim as a microle nsing candidate. One 



analysis of the PA survey data 



Calchi Novati et al 



20051 ) does include a similar 



Paczyhski + sinusoid fit in their analysis, and, as their sixth and final cut, reject 
a lightcurve as a probable variable star if both the Johnson- Cousins R band (A0) 
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flux difference between the modelled microlensing bump and another bump in the 
lightcurve which is due to the variable component is smaller than 1 magnitude 
and the time width of the sinusoidal part is compatible with that of the lensing 
bump within a factor of 2. Their sinus oid function is only a simple non-skew 



one, however. The other PA analysis, by 



Belokurov et al. 



(120051 ) . starts by defin- 



ing annuli of radius 6, 3, or 1.5 pixels around resolved stars (depending on their 
magnitude) which are then masked out. Then, as their first selection criterion, 
they considered the improvements in \ 2 of fit of a simple sinusoid over a con- 
stant baseline, and also the improvement in \ 2 f° r a Paczyhski fit over the same 
constant baseline. The actual cut then was to require that Axl m . < 0.75 A;^^,. 



However, in iBelokurov et al.l ( 120051 ) a joint variable plus lensing fit was not ex- 
plicitly employed. Many other variable sta rs were also rem oved by another cut 
in the two colour pseudo- magnitude plane (lAn et al.l . 12004 ) . 



Since t he density of variable objects in the Angstrom field is so high ( IDarnley et al. 
2007al ). (see Figure H~T1 for an example) it is considered essential to attempt to 
include the modelling of variable stars in the microlensing selection procedure 
so that it is possible to find the significant fraction of the microlensing events 
that occur which are blended with light from a variable star. Of course, multiple 
blended variables are also possible and even probable, but modelling the sum of 
an arbitrary number of variables is considered too complex to attempt at this 
stage. 

It was clear that certain classes of variable star lightcurves could be modelled 
as some kind of sinusoid function. In many known cases, the rise time of the 
lightcurve is shorter than the fall time. Therefore some kind of skew sinusoid was 
required. 
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Figure 4.1: Images illustrating the density of variable objects in the Angstrom 
field compared to the d ensity of all stars for microlensing surveys in the Milky 
Way. Image taken from lDarnlev et al.l (j2007al ). From left to right: a) LT i'-band 
image of a 2' region of the Angstrom field, b) The difference image corresponding 
to a), showing variable objects as black and white spots, c) The corresponding 
significance map, showing variable sources visible above background. Blending 
of variable sources is visible across the image. Blackened squares are masked 
out resolved stars, d) For comparison with c), an image from an OGLE-III 
Milky W a y bul ge field (OGLE image is from the OGLE Early Warning System 
fllJdalskil . l2003h for alert OGLE-2005-BLG-172). All of the objects in c) are 
variable, whereas only a handful of the stars in d) will be variable, and yet the 
density of objects is similar. 



4.2 Operation of the pipeline 



4.2.1 Introduction 



The main aim of the candidate selection pipeline was clearly to select promising 
microlensing candidates from the large number of variable lightcurves produced 
by the DIA pipeline. However, a secondary aim was to investigate the population 
of variable stars in M31, and to this end a sinusoidal function was fitted to the 
data to select those lightcurves which vary periodically and in a regular way. This 
process was complicated greatly by the number of lightcurves which clearly vary 
in an obvious way, but are not necessarily a good fit to a single sinusoidally vary- 
ing component. These might be examples of blended or multiply blended variable 
stars, or variable stars blended with microlensing candidates, or classical novae 
blended with variable stars, or some other combination of phenomena. Previous 
authors describing microlensing searches in M31 have, to a large degree, tried to 
exclude all periodically variable baselines from their selection of microlensing can- 
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didates (IBelokurov et all 120051 ; ICalchi Novati et all 120051 ; Ide Jong et all 120041 ) ■ 



It was decided at an early stage to attempt to include the variable star model in 
the selection process for microlensing candidates, in order to allow the detection 
of microlensing events closer to the core, where the probability of blending is 
higher. 

Therefore, a "combined" or "mixed" fitting function which consisted of a reduced 
Paczyhski curve added to the periodic variable function was included as one of 
the options the code could choose to fit to the data, as well as the variable 
star function or the reduced Paczyhski curve separately. During the analysis of 
each lightcurve, a five digit code was given to each one which described what 
was the general classification of the lightcurve (pure lensing, periodic variable, 
mixed, or flat), which of the cuts had been passed or failed, and what were 
the calculated signal to noise categories of both the variable and microlensing 
components. These data along with the calculated parameters for each of the 
cuts were output to enable post-analysis to be performed. A separate code was 
written which utilised this stored data to facilitate the relatively fast iteration and 
experimentation with values of cut parameters (without having to repeatedly run 
the much slower main code each time a cut threshold value was changed) and 
to roughly order the finally selected events in terms of how "well" the cuts had 
been passed collectively and hence how "good" each candidate was. This code 
performed exactly the same cuts in the same order as the main code but only 
used as input the fitting output data. Therefore its advantage was that no fitting 
had to be performed, which was the slowest operation performed by the main 
pipeline. However, if any changes to the fitting routines, or the way they were 
used, were made then the main code had to be run again. To ensure that the 
necessary data were always available to allow later experimentation with the cut 
values in a wide range, the values of the cuts used in the main pipeline were set 
to be rather loose. Certain cuts were then tightened significantly after the second 
stage, when experimentation with their values had been completed. 
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Candidate selection pipeline cuts - summary 



Cutl 

At least one significant 
bump exists 



QlL2 
Either a Paczynski or 
a mixed Paczynski + variable 
fit is the most appropriate 
fit to the data performed 



The V/d.o.f. ratio' 

of the best 
fit function over the 
next best is good enough 



Cut 4 * 

Event is not categorised 
as 'long period, 
low significance' 

f'fpr mixed events only) 



Cut 5 

X 2 /d.o.f. of global fit 
is low enough 



Data sampling of 
microlensing peak is 
good enough 



1 



Cut 7 
Event is not classified as 

long timescale', i.e. 
There are sufficient data 
points in the peak to allow the 
definition of the Mensing ratio' 



Cut 8 
Lensing Ratio 
is good enough 



No single 
over-dominant 
data point 



QHM 

Total significance 
of the data in the 
fitted peak is at least 
as good as required in 
Cut 1 to define a bump 



Cut 11 
xVd.o.f. of local fit 
to Paczynski peak 
is low enough 



£ui!2 

No significant flux 
bumps in residuals 



Figure 4.2: A graphical summary of the main cuts applied to the lightcurves in 
order to select likely microlensing events. 
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4.2.2 Summary of main actions of the main pipeline 

The main actions taken by the pipeline are summarised below in the order they 
are performed: In order of action: 

1) a) Reading in data; masking of reference image b) Sigma clipping of extremely 
small or large errors or fluxes 

2) Bump finding 

3) Bump rationalisation between telescope bands, if bands overlap in time 

4) Lomb-Scargle periodogram calculation 

5) Classification into "one significant bump" and "many bumps" and "periodic" 
or "non-periodic" 

6) Classification of the "many bumped" into "one bump significantly higher than 
the rest" or "all bumps similar in size" 

7) Calculation of best constant fit 

8) Depending on results of 5) and 6), functions fitted are either a) reduced 
Paczyhski b) skew cosinusoid c) sum of the two functions above, so called "mixed" 
Paczyhski + cosinusoid function 

9) Simultaneous calculation of "x 2 ratio" = (x 2 (A)-x 2 (B))/x 2 (A) (where A and 
B represent fits of two different functions to a lightcurve to be compared) for all 
combinations of fits performed on that lightcurve 

10) Lightcurve loosely classified as whatever the lowest x 2 fit is 

11) Calculation of x 2 per degree of freedom for best chosen fit 

12) F-test to weed out those lightcurves where the improvements in x 2 do not 
justify the number of additional parameters. These are downgraded to the next 
lowest x 2 fit- 

13) Exclusion of fits with extreme flux ratios between bands 
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14) Calculation of signal to noise estimators both for lensing and for variables. 
Classification into sub-categories for each class based on these estimators. 

15) For classifications where microlensing is involved, the same criterion for num- 
ber and significance of points in the finally fitted bump as was previously re- 
quired in 2) to define a bump was re-applied. This was necessary as sometimes 
a Paczyhski bump was fitted by the (x 2 minimising) fitting routines to what was 
basically one high point with a small error, surrounded by noise, rather than 
the intended bump found by the bump finding routines. In general there was 
no guarantee that the fitting will find the same bump as preferred by the bump 
finding routine. 

16) Calculation of local x 2 P er degree of freedom solely in the region around the 
Paczyhski peak. This was useful because the previously calculated reduced x 2 
for the whole lightcurve may be dominated by the variations in the background 
or by the poor fit to the variable part of a mixed fit. 

17) Cut on "x 2 ratio" was applied 

18) For "mixed" events: Assess whether lightcurve is "long period, low contrast" . 
(see Section 1473"]) 

19) Temporal sampling of Paczyhski bumps calculated. 

20) Cut on signal to noise and significance of points in the main bump applied. 

21) Cut on "local x 2 /d.o.f." is applied. 

22) Cut on Paczyhski bump sampling is applied. 

23) Cut looking for significant remaining bumps in the residuals to the best fit is 
applied. 

24) Selected events and useful data written out to file 

25) Plots of interesting lightcurves are created, for both variable and microlensing 
candidates. 
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The main cuts are summarised in Figure I4T21 and will be described in greater detail 
from Section 14.2.51 onwards. The numerical labelling of particular cuts used in 
Figure 14.21 is maintained throughout the sections below. 

4.2.3 Reading in data 

It is possible to request the reading of any subset of the available telescopes or 
wavebands of data through the use of a parameter array. In other words, analysis 
can be performed on any number of telescope wavebands greater than one and 
less than an arbitrary maximum of ten corresponding to the largest expected 
(number of telescopes multiplied by number of wavebands per telescope) for our 
project. In the original data files generated by the ADAP, (see Section 13. 3p . 
each data time-point is classified as being either 'CLEAN', 'CONTAMINATED', 
or 'NOERROR'. 'CONTAMINATED' data occurs when, for example, there is a 
bad pixel or a cosmic ray on the particular image being considered, within the 
PSF of the variable object for which the photometry is being calculated, which 
may make the flux calculated less reliable in an unquantifiable way. 'NOERROR' 
means that it is not possible to estimate the error for that particular time-point. 
Both the 'GOOD' and 'CONTAMINATED' points are read in by the Pipeline, 
but only 'GOOD' points are thereafter used in the calculations. The numbers of 
'GOOD' points in each band that were read in by the data-reading routine are 
counted and used throughout the rest of the pipeline. 

Reference Images 

For the reason stated in Section 13.3. li the data points associated with the reference 
image were removed from the 2007 photometry lightcurves before the pipeline 
analysed the data. For this photometry, the LT i'-band reference image was 
"LT_L20060131_012.fits", image taken on 31/01/2006. The LT r-band reference 
image was "LT_r_20050114_037", and the equivalent FTN i' and r band images 
were "FTN_L20051216_006" and "FTN_r_20051123_001". 
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Reference image masking was not required for the 2008 photometry data which 
used the combination of several good images as the reference. 

4.2.4 Using "Sigma Clipping" to remove outliers 

During the development of the Candidate Selection Pipeline, it had been noticed 
that there exist data points which have flux values and/or error values much 
greater than the average values By examining the plots of the best fits to the 
data it has also been noticed that for fits with a moderately low value of x 2 5 
the Paczyhski peak was placed at a particular to because around that time point 
there were one or two data points with higher than average flux but smaller than 
average magnitude of error bar. Hence the x 2 was being minimised by making 
the fitted function pass as close as possible to these "small error" points. 

The estimation of errors in the D.I. A. pipeline is a complicated process and it is 
difficult to prevent the occasional error bar being mis-estimated due to unlikely 
but possible occurrences in the pipeline. Therefore it was considered reasonable 
to apply sigma clipping not only to the flux values, but also to the error values. A 
subset of the images corresponding to the data points rejected were examined at 
the coordinates of the variable objects to which the lightcurves belonged in order 
to investigate whether any clear reasons existed for the "bad" data points. The 
majority of those examined were points close to the kind of major localised image 
flaws shown in the images in Section 12. 5. 1[ or just in very noisy difference images, 
but there were also in some images what appeared to be un-masked cosmic rays, 
as is almost unavoidable. 

Sigma clipping on flux values 

The discrepant flux point which occurs most frequently is most often numbered 
86th in the order of data from POINT-AGAPE field number 2, although this 
can vary minus a few if previous time points are missing. This data point occurs 
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at (JD - 2400000.5) = 51803.25. This point usually has fluxes of the order of 
thousands of units, where the mean flux is of order tens of units. In the 2007 
photometry, for example, almost all the data at this time index is marked by the 
DIA pipeline as "CONTAMINATED" , but still in a few lightcurves some data is 
classified as "CLEAN" and some of these points have either fluxes or error bars 
that are very large, as described above. It is found that setting a first cut on the 
flux at flux mean±10<7( or igi na i) for standard deviation "^"(original)" in each data band 
successfully removes this point and others like it, without impacting on the bulk 
of the data which is not discrepant. Once this first iteration has been performed, 
and discrepant points removed from the list of data, the standard deviation is 
re-calculated and the same cut performed using the new value of o. This iteration 
is repeated until the total number of points remaining becomes constant. 

Sigma clipping on error values 

Since errors on flux points are always positive in sign, a cut such as performed on 
the flux points which is symmetrical around the calculated mean will not work, as 
for some values of the multiplicative constant parameter (which has the value 10 
in the flux cut above) the lower cut would be at a negative value. The errors are 
expected to have a Poissonian distribution which has a most commonly occurring 
value near the mean value and tails on either side of this which extend down to 
zero error and upwards to infinity. In order to allow a simple cut to be made 
both on errors which are much smaller and much larger than the mean value, but 
using only one multiplicative constant parameter n, a data point was cut if the 
error value either exceeded n x MeanError or if it was less than MeanError/n. 
For example, if the constant was chosen to be 10, the cuts would be made in the 
two regions Error < MeanError/ 10 and Error > 10 x MeanError. 

Data about each point which was rejected by the sigma clipping algorithm, either 
due to a flux or error discrepancy, were written to a file along with the reason it 
was rejected and whether the limit exceeded was the upper or lower one. 
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Table 4.1: Table showing the numbers of data points rejected by the sigma clip- 
ping routine for a range of different values of the error parameter n. (The flux 
clipping parameter was always set at 10a). 



Error clipping 


Number Rejected 


Number Rejected 


Total Number of 


Parameter n 




for flux 




for error 


Data Points Rejected 




Low 


High 


Flux 
Total 


Low 


High 


Error 
Total 


Total 
(num.) 


Total 

(%) 


20 


397 


17 


414 


2081 


429 


2510 


2924 


0.182 


40 


416 


17 


433 


351 


323 


674 


1107 


0.069 


60 


426 


18 


444 


126 


214 


340 


784 


0.049 


100 


435 


18 


453 


30 


85 


115 


568 


0.035 


150 


444 


18 


462 


8 


29 


37 


499 


0.031 


200 


444 


18 


462 


5 


6 


11 


473 


0.029 



The parameter space of n was sampled as far out as n = 200, and even at 
the upper end of this range, there were still significant numbers of points which 
exceeded both the upper and lower error bounds. This seems to imply that the 
geometric nature of the use of the cut parameter n does not lead to too great an 
imbalance in the numbers of low and high points rejected. The statistics of the 
total number of points and their breakdown into "too low" and "too high" were 
analysed for both flux and error clipping, using only the first 2000 lightcurves (to 
speed up the process somewhat), and are presented in Table fl~Tt 

It can be seen that the numbers of points rejected for flux discrepancies varies 
with the value of n, even though the equivalent parameter for flux clipping was 
constant at 10. This is because if, for example, the error cut parameter n is set 
higher, then the number of data points rejected in the first iteration is lower, 
which means that more data points remain to be considered for the flux cut in 
the second iteration. In many cases, if a point has an overly large error it also 
has a proportionately large flux value, and so is rejected on the basis of flux in 
the second iteration whereas previously it was rejected on the basis of error in 
the first iteration. The number of points rejected on the basis of flux became 
constant for larger error clipping parameters than 150, which indicates that there 
are no more points with high flux magnitudes which have errors which are more 
than 150 times the mean error. It is clear that the final value for the number 
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of points rejected will converge to 462 if n is increased much further as almost 
no points are rejected on the basis of error when n is as large as 200. The total 
number of data points contained in the 2000 lightcurves was 1, 605, 674, which is 
divided into the total number of points discarded to give the percentages of the 
whole in the final column. 

4.2.5 Rejection of lightcurves with insufficient data 

The number of "good" data points in each individual telescope waveband are 
totalled together. If this number is greater than a threshold value, set at 25, then 
the pipeline proceeds with the analysis, else it "fails" the lightcurve and moves 
on to the next. 

4.2.6 Iteration over residuals 

The pipeline is designed to find the best fit to the data, using functions repre- 
senting constant flux, periodic variable stars, point lens point source microlensing 
or an additive combination of lensing and variability. When this fit has been de- 
cided upon, the residual of the fit is calculated. The pipeline then iterates by 
examining the residual in a very similar way as it did the original flux data, up 
to a maximum number of degrees of residual, currently set to be three. Iteration 
over residuals for a particular lightcurve is cut short when: 

1) The x 2 °f th e best fit function increases from the value found by the previous 
iteration, or if 

2) The x 2 g° es below the value of the parameter for the cut on global reduced x 2 
(Cut 5) (see below, Section FOj) . In other words a sufficiently good fit has been 
found, or if 

3) A constant is the best fit to the lightcurve, or if 

4) No bumps (as defined below, in Section I4.2.7P were found in the current iter- 
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ation's residual data. 

4.2.7 Bump Finding 

First iteration (flux data) 

The first process performed on the data is always to search for "bumps" in the 
data. That is, coherent areas in time within the data which show a positive 
deviation with respect to the baseline of the lightcurve. On the first iteration the 
data are always examined using a routine referred to hereafter as "bump_find" , 
which uses a point-by-point counting algorithm. This is done on each telescope 
band in turn, and a count is kept of the number of bumps found in each band. 
The first action performed is to estimate a baseline for the lightcurve. This is 
achieved by calculating the five point moving average for the lightcurve and then 
taking the minimum value of that function. Specifically five points were chosen 
as being a reasonable compromise between being sufficiently localised to react 
to short term downward fluctuations, but not so localised as to prevent some 
smoothing over noise in the lightcurve. Next, the lightcurve is examined point 
by point looking for positive deviations above this baseline value. A parameter 
fdif — {Ft ~ B)l {Ei), where F t is a flux point, B is the calculated baseline and 
is the error bar on the flux point, is calculated for each point %. If this number 
goes above 3, i.e. 3<7j for any point then the count of points n p in the possible 
bump begins. If f di f goes above 5 then a separate count of "high signal to noise" 
points n h is begun. While a bump continues, the total "significance" of the 
bump is calculated by summing /<&/ quadratically if fdif is positive. If a point 
drops below fdif — ^ then a count of non-significant points n m is started. If this 
number reaches 2 when n p > 5 and the > 3 and the points are consecutive 
then the bump is simultaneously aborted at that point and counted as a "real" 
bump. This combination of significances guarantees that any bump found is at 
least 8.77c above the baseline. An additional prescription is introduced largely 
to mitigate a perceived inconsistency in the logic caused by the "n m = 2 to end a 
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bump" condition. It is possible for a bump to be begun and then for a single point 
to have such a large negative /<&/ that the totalled fdif for the bump becomes less 
than zero, but without triggering the n m = 2 condition. If the new point still has 
< fdif < 3, i.e. is positive but not large enough to be considered "significant 
enough" then nothing unusual happens. Its fdif is added quadratically as before. 
Hence even points which are considered "not high enough to be part of the bump" 
can still contribute in a small way to the total significance. If the new point has a 
negative fdif and subtracting the /J^ of the new point number % from the current 
total of f\\f of the bump so far would not make the total Tifdif/ up to the i th 
point in the bump negative then this is done, by 



Running total of bump significance = = yS/^^i) 2 - f dif / (4.1) 

Thus the negative f d if data point reduces the significance of the bump but does 
not end it. However, if calculating fdifu-i) ~ fdif,i resu hs in a negative number 
and there have been insufficient points with positive fdif then this is classed as 
a "false start" and the bump is aborted without being counted as "real". The 
start and end of the bump are set to the first and last points where 3 < fdif- The 
routine also returns the total \ 2 °f each bump, and an estimate of the central 
time of the bump calculated by a mean weighted by /JL^ for all the time points 
% in each bump. 

Subsequent iterations (flux residuals) 

In situations where a "good" fit has been found between the applied model and 
the data, it is expected that any remaining significant coherent deviations from 
zero will be much less obvious than those in the original data, "good" here means 
a fitted x 2 /d.o.f. which is reasonably low, but not low enough to pass Cut 5 (see 
Section fl~3]) . Therefore on subsequent iterations, which attempt to find remaining 
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variations in the residuals to the first fit, a different routine is used. This is 
designed to detect the one most significant deviation (bump) in the lightcurve. 
This is mainly aimed towards detecting microlensing events which may have been 
overwhelmed on the first iteration by a more obvious variation (likely to be stellar 
variability), but which can be detected after this first order variation has been 
subtracted. This algorithm works by calculating the fdif.i of each point i in the 
lightcurve and ordering them with a sort routine so that the most significant 
deviation is at the top of the list. Next it looks for a cluster of five or more points 
at the top of the list which are consecutive in time order, with the exception that 
there can be one "outlier" interposed within this group. If such a group is found, 
the search is extended to the next point on the list, which is either added to the 
bump if it is contiguous with it or not if it is not. The bump is extended until no 
more points can be added. The jd%j,i is used as the determinant of significance 
because if /J^ were used a large negative deviation from zero residual would be 
treated in the same way as a large positive one, meaning that roughly half the 
time it would subsequently be attempted to fit a negative dip in the lightcurve 
with a positive Paczyhski curve. In addition, residual lightcurves with a lot of 
"noiselike" variations where consecutive time-points are often alternately positive 
and then negative would commonly be detected as having bumps in /j^j when 
this in reality only shows high variability, not a positive signal. The central peak 
time is calculated using a fd% /^-weighted mean of all the time points in the group. 
The width of the hypothetical Paczyhski peak is estimated from a fdif,r weighted 
mean difference from the central peak time calculated from all the points in the 
bump group. 

4.2.8 Bump Combining 

Because the data analysed are contained within several separate telescope wave- 
bands, which could not be directly combined due to lack of good knowledge 
about the relative flux scaling factors, it is possible that one physical bump in 
the lightcurve may have been detected in several different bands at once. There- 
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fore in order to correctly assess whether the lightcurve can be characterised as 
having "one bump" or should be better classed as "many bumps" , it is necessary 
to decide how to combine bumps which overlap in time in different bands. There 
are several ways of doing this, but the more conservative convention adopted for 
this work was that two bumps in different bands should only be combined if the 
boundaries of one bump are completely contained within the boundaries of the 
other. Two bumps which overlap partially at one end are allowed to remain as 
separate entities. A hypothetical example of this is shown in Figure 14.31 Three 
bands of data are shown with an initial total of six detected bumps. The start 
and end of each bump are shown by the extent of the horizontal brackets. The 
significance of each bump is illustrated by the thickness of the horizontal line, 
more significant being thicker. It can be seen that after the combination process 
has been completed, the two bumps in Bands 1 and 2 which were latest in time 
have been combined into one larger bump in the bottom panel. The pipeline 
simply adds together (quadratically) the S/J^j values of two combined bumps to 
show that there is more evidence for the combined bump due to it having been 
detected in two bands. The end points of the combined bump are calculated by 
finding the average of the two initial bumps' end points, weighted by their initial 
^fdifi va hies. Hence the boundaries of the final bump will be closer to those of 
the more significant original bump. These bump width values are only used as 
starting points for the MINUIT (CERN, 2006) minimisation to follow and so need 
only be approximate. The combination of bumps occurs serially, so that bumps 
considered later are compared with the new combined end points of any bumps 
already found to overlap, rather than trying to find "nests" of bumps which all 
lie inside one another from the totality of bumps and combining them all at once. 

It has been considered that this algorithm might have problems if the beginning 
of a physical bump is detected in the data of one band, which then has no more 
observations, and the end of the same bump is then detected in observations 
beginning in a second band. In this case, if the data in the two bands does 
not overlap or overlaps only partially, one physical bump might be classified as 
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Band 1 



Band 2 



Band 3 



Combined 
result 



Time 



Figure 4.3: An example of how the bump-combining process works. The three 
bands of data shown in the top three panels are analysed to find any pairs of 
bumps which lie completely inside one another. These are then combined into 
one by adding the values and finding the H/j^ -weighted average of each 

of the end points. The boldness of the lines represents the relative significance of 
each bump. 

two. However, since LT and FTN data are collected simultaneously during the 
observing season, this situation should only occur at the boundary between PA 
data and Angstrom data, or alternatively for very short timescale bumps which 
insert themselves between clusters of data in different bands. Even in the instance 
of this occurring, the most serious consequence for the pipeline would be that a 
mixed (Skew-Cosine + Paczyhski) fitting function would be used instead of a 
simple Paczyhski fit (see Section 14. 2. lip . This should not be too serious, as 
either the Paczyhski or the "Cosinusoid" function, or both, are free to vary their 
amplitudes down as low as zero and so, practically, can regain the fit that should 
ideally have been found. 
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4.2.9 Periodogram Analysis 



After the number of bumps in the lightcurve has been established, A p eriod 



finding routine which implements the Lomb-Scarg 



1975 



Scargle 



1982 



Shobbrook and Lomb 



1972 



e Periodogram a 



gorithm fjLomb 



Shobbrook et al 



1972|) is used 



on each band of data separately to find significant periods. Currently, 1 million 
frequency divisions are used which was a number merely chosen large enough 
(since this was a computationally fast algorithm) to ensure the smallest step in 
period was sufficiently small to be unimportant, especially since the periods found 
are only used as "initial seed" values for the fitting. From the periodogram, pe- 
riods are first selected which are local maxima and then, the most significant 
periods are chosen using a cut of Periodogram(frequency) > 30. 

The period range searched by the periodogram began at 0.5 days for the analysis 
of the 2007 data, but was reduced to 0.01 days for the 2008 data. The highest 
period searched is 1.5 x (t max — i m i n ) where t max is the latest data point for the 
particular lightcurve and t m i n is the earliest data point. A multiple of 1.5 is 
chosen based on the logic that for a period as large as twice the range of the 
data At = (t max — t m i n ), a variable star might not be reliably distinguished from 
a Paczyhski curve, whereas a slightly shorter period of 1.5 x At should have a 
noticeable rise at one end or the other above the hypothetical Paczyhski baseline, 
if there is sufficient data coverage. 

The number of significant periods found, including when zero periods are found, 
along with the number of bumps found, is used to make further decisions about 
what fitting operations to perform on the lightcurve. This is further described in 
Section OTT1 



4.2.10 Fitting functions used 



The CERN MINUIT (CERN, 2006) (originally written bv l.Tames and Roosl f ll975f n 
software was used to minimise the y 2 of the fit. Where possible the allowed ranges 
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of parameters were left unconstrained as this increases the efficiency of the search 
algorithm, but since MINUIT is a "local minimisation algorithm" and not de- 
signed to find "global" minima, it was also found to be important to pre-calculate 
guesses for the starting values of parameters as well as possible. The particular 
fitting function used in all fitting routines was "MINI". This in turn calls the 
MIGRAD routine, but switches to the SIMPLEX routine if MIGRAD fails. MI- 
GRAD uses variable- metric (also know n as "quasi Newton") algorithm, based 



upon the algorithm of iFletcherl (I1970I ). with a stable metric updating scheme, 
and checks for positive-definiteness of the calculated covariance matrix, because 
the search directions determined by MIGRAD are guaranteed to be downhill only 
if the covariance matrix is positive-definite. This algorithm estimates the matrix 
of second derivatives (the "Hessian") of the defined "quality function" with re- 
spect to the various parameters and uses the inverse of this matrix to iterate the 
parameter values towards the minimum of the quality function. Thus the com- 
puting cost will rise as the size of the matrix that needs to be inverted rises. The 
matrix involved is of size nxn, where n is the number of fitted parameters. The 
stated main weakness of this algorithm is that it depends heavily on knowledge of 
the first derivatives, and fails if they are very inaccurate . The SIMPLEX routine 



is based upon the simplex method of iNelder and Meadl (119651 ). 



Attempts were made to reduce the number of fitted parameters to a minimum, 
in order to increase the efficiency and reduce the required computing time of 
the fitting, and consequently the overall running speed of the program. To this 
end it was possible to link the fitted amplitudes of LT i'-band and FTN i'-band 
difference image data and also those of LT r-band and FTN R-band, thus reducing 
the number of parameters by one when each of these pairs of data sets exist 
and are used. The average ratios between these telescope bands were found by 
comparing many small equivalent areas of an image for each telescope. The ratios 
found were 

FluX LT y =F1UX ratiO( LT i'/FTNi') x FluXpxNi' 
and FluX LTr =F1UX ratiO(LTr/FTNr) x FluXpTNr 
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where the constant ratios between the difference fluxes in bands assumed were 
Flux ratio(LTi'/FTNi') = 1-081 and Flux ratio(LTr/FTNr) = 0.305 

The fitting code can either be used with or without maintaining a fixed scaling 
factor between the flux amplitudes in the PA and (LT and/or FTN) bands. The 
first links the PA flux amplitude to the LT and/or FTN flux amplitudes using 

FluxpAi' = ^x^oLTl/FTNi'' x FluXLTi ' and nence Flux PAi' =Flux ratio(p Ai /FTNi) x 
FluxpTNi'- The second uses the flux amplitudes in PA data as variable fitting 
parameters and does not assume any knowledge of the value of the flux ampli- 
tude ratios between PA and LT or FTN. The advantage of linking the PA flux 
amplitude to the other bands used is that fewer fitting parameters were required, 
which simplifies the parameter space used, meaning that the code can run faster 
and more efficiently. However, for initial investigations of the flux ratio between 
PA and LT and/or FTN data the "unlinked" version had to be used, for obvious 
reasons. Using the "unlinked" version also has other disadvantages in that the 
fitting routines are free to find any values of flux amplitude which have the ef- 
fect of reducing the overall x 2 ? which can occasionally lead to unphysical ratios 
between waveband flux ratios or strange combinations of Paczyhski and Sinusoid 
parameter values. These solutions may improve the fit to a particular data set, 
as measured by x 2 5 but at the expense of flux amplitudes and hence flux ratios 
between bands which could never occur in reality. Thus additional prescriptions 
and safeguards had to be introduced (which are described below) to minimise the 
number of these "wild" fits which pass through the selection cut system. 

The reduced Paczyriski function for microlensing fits 

A reduced Paczyhski curve is fitted in which the flux varies with time as, 

F(t) = B + AF/y/1 + 12[(t - t )/t FW HM] 2 (4.2) 
where t and t are the epochs of observation and maximum brightness, B is the 
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baseline flux, AF = F(t ) — B is the maximum flux deviation and £fwhm is the 
duration of the full-width at half-maximum. 

The initial conditions for the fit i.e. the to an d the estimation of the order of 
magnitude of £fwhm are primed by the output obtained by bump_find from the 
width and position of the (largest or only) bump. In order to exert some control 
on Paczyhski peaks with huge amplitudes being fitted with t 's in gaps in the 
data, (attempting to minimise the x 2 to an upwardly deviant point near the edge 
of the data immediately before or after the gap), the Paczyhski amplitude was 
limited to 100 x (Flux max — Flux min ). 

The Skew Cosinusoid Function for variable stars 

The skew cosinusoid function chosen, which was based on oc t s , was designed 
to be capable of being adjusted by the fitting routine continuously from zero 
skewness (a simple cosine curve) to high skewness where the rise time is much 
shorter than the fall time. 

In this model, the phase rises from zero initially as t s , where S is the skewness 
parameter. After the middle of the period, at P/2, it rises the rest of the way to 
4> = 2ir as the double reflection around = n and t = P/2, i.e. 0(t) initially has 
an increasing gradient, but after half the period has a decreasing gradient back 
down to a gradient of zero at t — P, — 2ir. For a skewness of 1, this function 
reduces to = t and has smooth gradient changes over the whole period, as 
required. Since the rise of this function steepens rapidly with increasing S, in 
order that the function be continuous at = n and t = P/2, the function must 
be normalised to equal tt at this point by dividing by 

For a practical reason, (since the zero phase of a Cosine function occurs at the 
peak of the function and bumps in lightcurves are being searched for) a Cosinusoid 
was actually used instead of a Sinusoid. To model a variable star lightcurve, 
parameters required are : the amplitude of the cosinusoid, F max , the baseline flux 
B, the zero phase of the cosine O where phase = -p2n and the phase function 
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A. The temporal variation of the flux F is represented by: 



F 



cos(A — (j) ) + B 



(4.3) 



The phase function A is defined by: 



A 




is 



(2tt - 4>) b - f if vr < < 2tt 



7T 



2 



if < < 7T 



(4.4) 



The skewness parameter, "S" 

The cosinusoid function was originally conceived varying in the range 1 < S. 
In this range, for relatively moderate values of the skewness parameter, a point 
of inflection, occurring at <fi = ir, begins to be visually obvious around 1.5 < 
S < 2. By choosing examples of lightcurves that were deemed "acceptable" 
or "unacceptable", the upper limit for S in the fitting routines was reset to be 
S < 1.574 which corresponds to the rise time being a fraction of one period 
of 0.356. This was the limit used for all fitting on the 2007 data, including 
the investigation of variable stars (Section 15.5.1)) . During later development it 
was found that S could also be allowed to be less than one and still produce 
a physically useful skew cosinusoid function. In order to get a skew cosinusoid 
which has a rising portion which is shorter in time than the falling portion (as 
in many variable stars), when using S > 1, the sign of F var in Equation 14.31 must 
be negative, but if S < 1 then the sign of F var must be positive to achieve the 
same end. When S < 1 the skew function is smoother and has a different shape 
than when S > 1 and does not suffer from the function becoming no longer 
smooth as the value of the skewness parameter becomes lower. In the fitting 
algorithms these conditions are imposed by formally limiting the ranges of the 
fitting variables used in MINUIT. The maximum skewness used in the fitting 
corresponds to the rise time being > 5% of the total period, which is about the 
most extreme seen in variable stars. The equivalent limit in the skewness when 
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S < 1 is 0.02314 < S < 1. With the above function, the fraction of the period 
taken by the rise time is given by Equation 14.51 Mathematically this second S 
limit is equivalent to using = 0.95 in Equation (j4.5j) . as when the function is 
reflected when the flux amplitude changes sign, the rise time then becomes the 
fall time and vice versa. Both possible regimes of the function, i.e. 

0.02314 < S < 1, F w > and 1 < S < 1.574 , F var < were utilised in the 
fitting to the 2008 data. 

Some examples of the skew cosinusoid function, plotted for varying values of 
skewness parameter, are shown in Figure 14.41 

The to of the largest bump in the lightcurve, found from the bump-finding routine 
is used to define the starting point for the fitting of the zero phase of the cosine 
function. 

If the periodogram has been calculated and produces any significant periods, then 
these are used as initial conditions for the Skew Cosine part of the fits. Otherwise 
when either the periodogram is not used, or if it is used and does not find any 
periods for a particular lightcurve, the period range between 0.5 to 1000 days is 
equally sliced in log(P). In both this function and the joint fit below, MINUIT 
limits on the fitted period are only defined (to give the size of the required P 
slice) if P is sliced, but allowed to vary freely if the periodogram is used. For 
the 2007 photometry, 52 overlapping slices are used to avoid having touching 
slice boundaries and to give 50 slices covering the actual region of interest, but 
this is reduced to 6 for the analysis of the 2008 photometry in order to reduce 
computation time. 
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Skew Cosinusoid Function plotted for several different values of Skewness 
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Figure 4.4: Plot showing examples of the skew cosinusoid function used in the 
candidate selection pipeline for varying skewness parameter. The skewnesses 
shown are 0.2314,0.50,0.75,1.00,1.25,1.574 and 1.75. The first three, which have 
S < 1, are shown in blue- in practice these would be effectively time-reversed by 
using a negative amplitude. The next two (1.0,1.25) are shown in green. The 
upper limiting value of 1.574 is shown in amber, and a value of 1.75 which is 
considered to have too extreme a point of inflexion to be usable is shown in red. 

The Joint Paczynski + Skew Cosinusoid Fit Function 

This consists of a simple sum of the reduced Paczynski and Skew Cosinusoid 
Fit functions described above. It was necessary to include this kind of function 
because Angstrom is observing close to the M31 core, where stellar crowding is 
a serious problem. Therefore a large fraction of variable objects contained some 
kind of detectable contribution from one or more variable stars. If a "traditional" 
cut specifying a requirement for a flat baseline outside the lensing peak had been 
used, many events in our fields would be discarded. 
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4.2.11 Selecting which functions to fit 



Based on the output of bump_find (or chLbump) and the Periodogram, the 
Pipeline decides which functions are most appropriate to attempt to fit to the 
light curve. 
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Figure 4.5: Flow-Diagram showing which fitting functions are used given par- 
ticular attributes of the lightcurve such as a) number of bumps found (if any), 
b) dominance of one bump over any others, c) periodicity or lack of it. On the 
bottom (green) level the fit functions used are designated as P = Paczyhski, SC 
= Skew Cosine, PSC = Paczyhski + Skew Cosine. On the second level from 
the bottom, "L-S P" refers to the value of the Lomb-Scargle Periodogram at the 
period of interest. 



In all cases the best fit constant value is calculated along with its \ 2 . This is 
then compared to the more sophisticated fits used. There are several layers of 
filters in the logic tree, and this is more easily shown in its entirety by Figure 14.51 
However, the basic philosophy is that if the lightcurve has only one bump then a 
Paczyhski fit is performed, along with the Skew Cosine function for comparison. 
If the lightcurve has more than one bump, then it is likely that a Skew Cosine 
function is a possible fit, so this fit is performed. However, if one of the bumps is 
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significantly "more prominent" than the rest, then the possibility of microlensing 
superimposed on a variable star is considered, and a joint (Paczyhski + Skew 
Cosine function) fit is also performed. 

For the definition of "more prominent" currently chosen see Cut 2 in Section 14.31 
below. 

4.3 The Cuts 

Cut '0' Rejection of lightcurves with insufficient data 

The number of "good" data points in each individual telescope waveband are 
totalled together. If this number is greater than a threshold value, set at 25, then 
the pipeline proceeds with the analysis, else it "fails" the lightcurve and moves 
on to the next. 

Cut 1 At least one significant bump exists 

In order to be a possible candidate, a lightcurve must have a coherent area of 
the lightcurve in which the difference flux is significantly higher than either the 
rest of the lightcurve in the case of a sufficiently flat baseline, or at least, if the 
variability is sufficiently high, its immediate neighbourhood in time. Therefore 
an algorithm was constructed (see Section 14.2.71 above) which identifies "bumps" 
in the lightcurve. If no such bumps are identified, the lightcurve is classed as 
"constant" , and no further fitting is performed, meaning that it cannot be either 
a lensing or variable star candidate. 

Cut 2 Microlensing is required for the best fit to the lightcurve 

When the x 2 /d.o.f. of whichever fits were performed on a particular lightcurve 
are compared, to be selected as a lensing candidate : a) Either a Paczyhski or a 
mixed fit must have been fitted to the lightcurve and b) One of these must have 
the lowest xVd.o.f. fit of all fits performed, c) An "F-Test" (see Section P4.4. 1|) 
must decide that either a Paczyhski or mixed fit is the most appropriate fit, given 
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the numbers of fitting parameters. 

For condition a) to occur a criterion must be satisfied that is designed to look for 
lightcurves with one bump which is more prominent than the others, and which 
would therefore be more likely to be well fitted by a joint fit. 

The most prominent bump is compared to the second most prominent bump in 
two separate statistics, namely the significance of the bump as defined by the /<&/ 
values (Section 14.2. 7p . normalised by the number of points in the bump n p , and 
the largest f&f component due to a single data point in each bump. (This looks 
more for brief but sharp flux spikes). The final criterion was: If the number of 
bumps found was more than 1 and either 

where subscripts 1 and 2 refer to the most significant and second most significant 
bumps respectively, or, 

fdif,i,l > 1-5 fdif, i,2, 

where u fdif,i" is the largest value of fdif,i within the bump for a single data point 
i, then a mixed fit was fitted in a) rather than a Paczyhski fit. 

If there was only one significant bump in the lightcurve, then a simple reduced 
Paczyhski curve was fitted. 

Cut 3 Selection on \ 2 difference ratios 

In order to be selected as a candidate, either as a microlensing event or as a 
variable star, a lightcurve must pass another cut, which is designed to ensure 
that the chosen best fit is clearly better than the next best. The best fit function 
is only selected as a candidate if the percentage difference between x 2 /d.o.f. of 
the best fitting function and the second best fitting function is greater than some 
threshold value. Equation 14.61 shows a 10% reduction in x 2 /d.o.f. The quantity 
which must be evaluated to assess this is referred to as the "x 2 difference ratio" , 
or "CDR" and it is defined between competing fits A and B, 
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CDR = ^/d-o. f - Xi/d.o.f > Q A 
Xi/d.o.f 



The threshold chosen for the final selection in this work was 0.4. 



This cut is similar to the first cut used by iBelokurov et al.l (120051 ) except that 
in my analysis the joint fit is also performed and possible lensing events can be 
classified either as "mixed" or "Paczyhski" , depending on which fit has the lowest 
X 2 /d.o.f. 

Cut 4 "Long period, low contrast" mixed events 

For "mixed" fits, it is harder to be confident in the fit to the lensing part of 
the "mixed" fit if in turn we are less confident about the variable part. One 
circumstance which may reduce our confidence in the variable part of the fit is 
when the period of the variable is very long compared with the time span of the 
data for a particular lightcurve. Then the fitted variable function may not repeat 
sufficiently, and it is not possible to say for certain whether the behaviour of the 
lightcurve would actually remain the same outside this region, if the data existed. 
Therefore, it is demanded in the case that the period is shorter than half the time 
span of the data (the variable does not repeat fully at least once) that the ratio of 
the fitted Paczyhski amplitude and the fitted variable amplitude is greater than 
5. In the case of the period being longer than this criterion, a secondary condition 
is applied on the amplitude ratio, described below. 

The area of parameter space in which the Paczyhski peak and the peak of the vari- 
able component may most easily be confused is when the time span of the peaks 
is of the same order of magnitude. This occurs roughly when period/2 ~ tpwHM- 
Hence in this region a higher ratio is demanded between the fitted Paczyhski and 
variable amplitudes in order to avoid classification as "long period low contrast" . 
Outside this region, a lower criterion on this amplitude ratio is demanded. The 
central region surrounding the above condition is chosen as £fwhm < period < 4 
^fwhm, and the resulting dual conditions which must be satisfied for a lightcurve 
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to pass the cut are: in the region ^fwhivi < period < 4 £fwhm : F pa _ c /F vllT > 5.0, 

where F pac is the flux amplitude of the fitted Paczyhski peak and F var is again 
the flux amplitude of the variable component, 

and in the surrounding regions, period < tpwHM or period > 4 tFWHM, 

-fpac/ -^var > 0.5. 

Lightcurves failing any of these conditions are labelled as "long period, low con- 
trast" events and are saved in a separate file. The original motivation for de- 
veloping this cut is described in greater detail, with an example lightcurve, be- 
low in Section 14.5.21 This cut has a similar effect to the sixth and final cut of 



Calchi Novati et al. 



(2005) 



Cut 5 Selection on reduced % 2 

To ensure that only good fits to the data are passed, a cut is made on the reduced 
X 2 of the data relative to the best fit function. If all the physical variation in the 
lightcurve is entirely modelled by the fitted function, this number would be ex- 
pected to be of the order of 1. However, experience gained modelli ng a particular 



lightcu rve found by the Angstrom Project Alert System (APAS) (IDarnley et al. 



2007al ). showed that in our data there is much variation of the "background" to 
the objects which cannot always be modelled by simple one or two object fits 
such as described here. This may be caused by the high density of stars in the 
M31 bulge causing additional variable objects to be unavoidably blended in with 
the PSF of our studied objects. Therefore a preliminary loose cut is imposed in 
the main pipeline of a value for the reduced \ 2 < 15. Tighter cuts on reduced 
X 2 are imposed in the subsequent second part of the analysis using the post-run 
analysis program. 

Cut 6 Sufficient points in central peak 

For lightcurves fitted with either a reduced Paczyhski curve or a combined Paczyhski 
+ Skew Cosine fit, a cut is imposed to ensure only candidates which have suffi- 
cient data points in the central region of the peak are allowed through. Without 
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this cut, many Paczynski peaks would be fitted with peaks sitting in gaps in the 
data. Also, it was found that this cut could be used as one of the strongest ways of 
preventing classical novae being selected as (particularly short timescale) lensing 
events. One of the lightcurve examples which informed the evolution is described 
below in Section 14.41 A difference was made in the treatment of Paczynski peaks 
with values of tFWHM above and below 2 days. This stemmed from the desire to 
only select fast-rising lightcurves if there was enough data in the early and late 
rising and falling parts respectively of the curve (i.e. the "wings" of the curve) 
to assist in distinguishing between fast novae and microlensing, which can look 
quite similar given sparse sampling around the time of peak flux. Hence a stricter 
cut was applied for shorter timescale lightcurves in the wing regions (defined as 
between tFWHM/2 and 2£fwhm away from the central to). 

For lightcurves with tpwHtvi < 2 days: 

Within the time range (to ± f " HM ) number of data points > 1. 
Within the range (to ± ' FW 2 HM ) number of data points > 3. 

Within the range (t + *e^hm) < t < (t + 2t FWHM ) number of data points > 2. 
Within the range (t - 2t FwnM ) < t < (t - *™ HM ) number of data points > 2. 

For lightcurves with tFWHM > 2 days: 

Within the time range (to ± '™ M ) number of data points > 1. 
Within the range (to ± ' FW 2 HM ) number of data points > 3. 
Within the ranges (t + tssatx.) < t < (t + 2t FWHM ) 
OR (t - 2t FW HM) < t < (t - SEsmi) number of data points > 2. 

So to summarise: for timescales less than or equal to 2 days, a minimum of 
(1 + 3 + 2 + 2) = 8 data points are required inside the region (to ± 2tFWHM) • In 
addition, there must be 4 points in the central region, and at least 2 points in 
each of the "wings" . 

For timescales greater than 2 days, a minimum of (1 + 3 + 2) = 6 data points are 
required inside the region (t ± 2t F wHM)- In addition, there must be 4 points in 
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the central region, and at least 2 points somewhere in one or both of the "wings" . 

The specification in both cases that there must be one point in the very central 
region (i ± ' FW 4 HM ) as well as 3 in the less central region (i ± tpw ™ ) is an attempt 
to reduce the instances of clumps of three points being very close together in time 
followed by a data gap precisely where the peak is. Data tend to be clumped in 
this way as, for example, during good weather conditions many images can be 
taken. 

Cut 7 The lensing timescale 

Definition of the signal to noise parameter as defined for lensing candidates is 
discussed in more detail below in Section 14.3. 2\ but in summary, any lightcurve 
where there are fewer than 10 data points outside the peak region t — 2£fwhm > 
t > to+2tFWHM cannot have the signal to noise of the peak assessed by the method 
used for the majority of lightcurves. Therefore these lightcurves are not included 
in the main group for selection, although with the addition of more data points 
after the peak, they may yet be included in future. The lightcurves which fail this 
cut will be henceforth referred to as "long timescale" , as in the majority of cases 
when there are insufficient data in the baseline to define a standard deviation this 
is because the width of the £fwhm found in the fitting is comparable to or greater 
than the time spread of the data. Conversely, those that are not rejected at this 
stage are referred to below as "normal" candidates. 

Cut 8 Sufficient signal to noise 

For each lightcurve which has been selected as a lensing candidate by passing 
all the cuts described above, and also for each lightcurve classified as a variable 
candidate a "signal to noise factor" is calculated that quantifies how clearly the 
desired signal is seen above the noise. Different routines are used for variables 
and for lensing candidates, and these are described below. 
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4.3.1 Variables 



For variable candidate lightcurves, first the mean error of all the points is calcu- 
lated, along with the mean flux in each band and the standard deviation of the 
fluxes in each band from this mean value. Next, if the number of points in each 
band is greater than 10 (to ensure that the standard deviation value is reliable), 
then the mean value of the ratio R given by 



0"flux,band y\ 



E 



band 



is calculated for each band, where Eb an d is the mean error in each band, and 
0flux,band is the flux standard deviation in each band. Then, the weighted mean of 
the ratio values for each band (in which a standard deviation could be calculated) 
is calculated, to find a mean value of the ratio for all data points, as if all fluxes 
had the same flux scale. Hence, variability in bands with more data points have 
a greater weight in the final result. 

The final quantity calculated can be represented by 

5 var = (4.8) 

where k is the number of data bands with a number of points n» greater than 10 

This quantity S va _ T , referred to as the "variable signal to noise ratio" gives a value 
to the size of the variable signal relative to the expected noise as quantified by 
the error bars. The reasons for this definition are described in more detail later, 
in Section 14.6.11 

For a lightcurve which was completely random, and hence distributed in a Gaus- 
sian manner, it would be expected that on average 68.2% of points should lie 
within E of the mean, where E as above equals the mean error of all the points. 
Hence the ratio of the standard deviation of the points to the mean error should 
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be ~ 1. Therefore, any non random lightcurve i.e. one which has, on average, 
variations greater than the size of its error bars will have a ratio greater than 1. 
Hence a value > 1 is required by the pipeline for a lightcurve to be selected as 
having a detectable amount of variation above the noise. Lightcurves are graded 
from to 9 on the basis of this quantity, the top of the ninth category corre- 
sponding to a signal to noise parameter of 30, and the category divisions being 
linear and equal. It would be possible to impose a lower cut at a higher value 
than 1, but the investigation of the statistics of lower signal to noise variables is 
included in this work. It should be remembered also that a variable lightcurve is 
only fitted to the data if at least one significant period has been found in the data 
by the Lomb-Scargle periodogram algorithm. Thus truly random lightcurves are 
eliminated from the variable sample and a higher percentage of true signals would 
be expected. 

4.3.2 Lensing Candidates 

For lensing candidates, one of the desired qualities which must be quantified is 
the effective signal to noise of the lensing peak measured above the fitted baseline 
flux. Obviously it would be impossible to estimate the noise level by using all 
data points in the lightcurve, as these would include the candidate signal. Hence 
a region is defined outside the main peak, where the modelled lensing signal is 
insignificant, in which to estimate the background noise in the data. This region 
is chosen as being t — 2£fwhm > t > t + 2£fwhm- Initially, each band has 
its fitted baseline offset or variable component removed, so that the baseline is 
zero. For the lightcurves which are modelled as a joint Cosine + Paczyhski fit, 
the Cosinusoidal part of the fitted model is first subtracted to leave only the 
lensing peak, before the above calculation is performed. The variable signal to 
noise ratio of the Cosine part of the fit is then also calculated. The parameter 
quantifying the signal to noise of the event is then calculated by finding the mean 
value of the quantity fdif (see Section 14.2.7ft using all data points within the 
central Paczyhski peak region to — 2tFWHM < t < t + 2tpwHM- The ratio between 
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this mean f di f level (the "signal") and the standard deviation of all data points 
outside the peak region, where the lightcurve should be almost flat, (the "noise"), 
is then taken. This quantity, referred to as the "lensing signal to noise ratio" , or 
"lensing ratio" (or LR) for short, shows how many "sigma" above the baseline is 
the mean measured flux in the peak. If there were no lensing signal at all, this 
quantity would be zero. Objects were initially defined as having sufficient signal 
to noise for the initial selection using the most relaxed cuts if the lensing ratio 
> 2, which allowed examination of a greater selection of lightcurve fits, but in 
the final selection a higher value is required. Of course, for the same reasons as 
in the calculation of the variable signal to noise ratio, the standard deviation of 
the non-peak region is not calculated using low numbers of data points, so if the 
number of points in this region is < 10 then the lensing signal to noise ratio is 
not calculated. This situation will be more likely for candidate events with large 
^fwhm, as there will be fewer data points in the lightcurve which are not part 
of the peak. Thus only allowing through events with a defined value of lensing 
signal to noise ratio would remove some perfectly valid long timescale events 
and hence bias the final sample. Therefore, lightcurves for which this quantity 
cannot be defined were not entirely thrown away. This class of lightcurves have 
therefore been subjected to one fewer cut than all the others and so are kept in a 
separate output file. In this group there may be a small number of valid lensing 
candidates, but there are also a large number which have fitted amplitudes and 
hence signal to noise which are too low. To prevent lightcurves in which a long 
period variable star could mimic a long timescale lensing event from being allowed 
through as easily, the level of Cut 3 which is applied to these lightcurves is higher 
than the initial loose cut applied to all others; a value of 0.1 instead of 0.01. The 
lightcurves with "defined but too low" lensing signal to noise ratio or undefined 
lensing signal to noise ratio and too low \ 2 difference ratio are cut. 

Cut 9 No single over-dominant data point 

Several lightcurves were found which had one or perhaps two data points in 
the fitted bumps with very large difference flux values and small errors, giving 
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them a very highly significant deviation above the average of other points in the 
lightcurve. Since it is not felt to be a good thing to rely on a very few points for 
all the significance of an event (even if sufficient other data points exist to pass 
Cut 6), a check is made that of the data points in the fitted Paczyhski peak in the 
region (t — 2£fwhm) < to < (to + 2£fwhm) the numbers of significant points pass 
the original specifications for the numbers of significant points in a bump, i.e. if 
n p < n pmin or < n hmin in the final fitted bump then a lightcurve is rejected. 
If the numbers of data points within this region are fewer than n pmin then the 
region is widened until the number of data points is equal to that number. This 
means that if there are only n pm i n data points in the bump then all of them must 
be more than 3a above the baseline in order for this cut to be passed. 

Cut 10 Significance of fitted peak 

Since the fitting routine is not currently forced to place the Paczyhski peak within 
the bounds of the most significant bump in the lightcurve found in the bump 
finding section, it is possible that a lower x 2 solution could be found by placing 
to elsewhere, especially when combined with a cosinusoid function. Hence it is 
necessary to check that the points within the fitted peak have at least the same 
significance above baseline as was required to define a bump in the first place. 
The required value of the geometrical mean value of /^/ in the peak region is 
therefore also set as > 8.77. 

Cut 11 Xiocai/d-o.f. of the Paczyriski peak 

All data points within the region (t — 2.0 £fwhm) < t < (t + 2.0 £fwhm) were 
considered to be inside the fitted bump, and, using these points only, a "local" 
xf oca i/d.o.f. parameter was constructed by using the formula 



where xLmp = Y^^i^ _ -F fit ) /error;, n to tai is the total number of points in the 
lightcurve, nb U m P is the number of points inside the bump, and N is the number 



xLai/d-O-f- 



Xbump n total 



(4.9) 



(ntotal — N)n bu mp 
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of fit parameters. This is the same as finding the x 2 /d.o.f. using the contributing 
points in the bump only, then scaling the result up as if the whole lightcurve had 
contributed. The cut on X\ OC ai/d-ol. is set at x? 0C ai/d- -f-< 5- 

Cut 12 No significant coherent residual bumps 

By "significant coherent residual bumps" is meant groupings of consecutive data 
points all showing a significant deviation from the best-fitted model curve. Both 
positive and negative bumps are searched for, since a large positive bump or 
bumps in a lightcurve in addition to that fitted decreases significantly the confi- 
dence in the microlensing model and increases the likelihood that the flux varia- 
tions are due to other repeating variable phenomena. Negative bumps decrease 
the confidence in the variable baseline model fit and hence, by implication, the 
confidence in the lensing component fit, which relies on the subtraction of a satis- 
factory baseline fit. It must be emphasised that this cut is applied to lightcurves 
which have already passed the two x 2 /d.o.f. cuts, both of the global (Cut 5) 
and local (Cut 11) kind, so the fact that it became clear that this cut is required 
demonstrates that relying only on x 2 /d.o.f. to decide if a fit to a model was good 
enough is insufficient for our purposes. If the deviation/s, while clearly significant 
in themselves, are not too large, their excess x 2 ma y be averaged out over the 
rest of lightcurve and the x 2 /d.o.f. may still stay below a reasonable cut level. 
This can be possible if part of the lightcurve is an especially good fit to a part 
of the model (for e.g., the lensing peak) and has a below average x 2 /d.o.f., thus 
leaving more "free" x 2 to be shared out among the other data points, including 
the deviation. Simply put, relying on x 2 /d.o.f. requires that the model used 
explains (and hence removes from the residuals) any non-random flux variations. 
In our case, there exist many different combinations of physical phenomena which 
could produce lightcurves more complex than our most complex model, and hence 
non-random flux excursions can and do remain in the residuals. 

Coherent flux deviations in the residuals are searched for using two variants on the 
routine used to make Cut 1. For positive bumps, as in the original bump finding 
routine, n p is required to be 5 and rih is 2, using the same two significances of 3 
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and 5<j respectively as used in Cut 1. n m , the number of points allowed below 
threshold before a bump ends, is set to be more sensitive than for the original 
bump finding, with n m = 1, as opposed to 2 previously. Negative bumps are 
allowed to be broader with a lower peak significance, since they are not expected 
to look anything like a lensing lightcurve, requiring n p > 5, Uh = 0, n m = 2. In 
addition, groups of data with no points above the threshold are allowed to 
accumulate using more data points, as long as the geometrical mean value of fdif 
is as great as it would have been for the criteria above. 



4.4 Development of the Pipeline 

Checking that no useful data are being thrown away 

The microlensing candidate lightcurves which are most likely to have the most 
extreme flux variations and hence the most extreme departures from their mean 
fluxes are the three lightcurves associated with the position of the "short event" 
described in Chapter |5j which in the 2007 photometry are the three events 4427, 
6389 and 12147. These lightcurves were used to check whether the 10cr flux 
cut caused any of the points from "real" microlensing candidate events to be 
unintentionally thrown away by the sigma clipping routine. For the event 6389, 
the peak flux value magnitude is 137.59 ADU/s whereas the standard deviation of 
the whole lightcurve is 23.78. Thus, 10cr above the mean (which is close to zero) is 
~ 237.8 which is clearly larger than 137.59 with a reasonable margin. One point 
in this lightcurve is clipped, but this is one of the "typically" bad single points in 
the POINT-AGAPE (PA) data described above, and is 16.38a above the mean 
flux for the band. It is therefore known that the 10<r threshold is reasonable both 
on the high and the low side, being roughly between the "normal" deviant flux 
values and the most extreme known "real" flux values. 
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4.4.1 Using the F-Test to decide the most useful model 
fit 

In the early stages of the development of the Candidate Selection Pipeline, the 
model (i.e., Paczyhski only, mixed Paczyhski + Variable, pure Variable, or Con- 
stant) which was chosen to represent the classification of the lightcurve was de- 
cided by choosing the model with the lowest x 2 • However, it was quickly realised 
that since the models compared had differing numbers of degrees of freedom, that 
like was not being compared with like, and that in some cases it might be better 
to choose a simpler model with fewer degrees of freedom, even though it might 
have a slightly higher x 2 ■ The statistical test which can decide whether a model 
with more parameters but a lower y 2 makes a big enough improvement in the y 2 
as compared to a simpler model, to justify the use of the extra fitting parameters, 
is the "F-Test" . 

For the current application, the "F-ratio" which must be calculated in order to 
perform the test can be written as in Equation 14.101 

(x 2 i-x 2 2 ) 

F-Ratio= (P1 7 2) (4.10) 

X 2 
"-total - VI 

(where ntotai = X]fc=Y dSnfc )> to express the F-ratio in terms of the "observables" 
in the selection pipeline, the x 2 ' s of each model, numbers of parameters p% and 
P2, and the number of data points ntotai- 

A confidence level of 0.05, which is the standard level used for F-tests, was used. 
This means that the probability that the more complicated model is NOT better 
is 5%. 

In all cases the model which had been found to have the lowest x 2 fit was only 
compared, using the F-test, to models with lower numbers of parameters than 
itself, as if a model was more complicated, and had a worse x 2 ■, then it would 
definitely be a worse model anyway. 
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If the status of the lowest x 2 model was "downgraded" because the F-test showed 
that it is not the best model to use, then the "classification" of the lightcurve 
was changed to that of the simpler model which displaced it. Then the new "best 
model" was compared in turn with any remaining simpler models to see if it was 
in fact the "simplest and best" of all models fitted. The classification remained 
fixed only if a model "wins the contest" with the next simplest model. 

4.4.2 Extreme flux ratios 

In the version of the code where the flux amplitude ratios between PA and LT i'- 
band data are not fixed, it is sometimes possible for the fitting routines to return 
extreme flux ratios, especially in the "mixed" fits. There are several ways that 
the fitting routine can combine a Paczyhski function "creatively" with a skew 
cosinusoid to slightly reduce the x 2 °f a basically flat or simple periodic variable 
lightcurve. Therefore, extreme fitted flux ratios are not retained as classified 
lensing but downgraded to the classification with next lowest x 2 ■ F° r example, 
in the case of the flux ratio between the amplitudes of the lensing components 
in PA and LT data the status of the candidate is only changed if two conditions 
apply: 

a) The flux amplitude ratio of the fitted Paczyhski components is more than 3 
times what is expected, i.e. more than 3 times the "normal" amplitude ratio 
which is taken as 8.53 for these purposes and b) The to of the fitted Paczyhski 
peak is close to the boundary of the two data sets, which occurs at around ( JD — 
2400000.5) = 53000. For these purposes, "close" is defined as i) within (t + 
£fwhm > 53000) for a t m the PA data and ii) within (to — £fwhm < 53000) for 
a to m the LT data. In this area, there is a chance of some interaction between 
the Paczyhski peak in both bands, whereas if the central peak is only covered 
in one band, the fitted amplitude ratio would be expected to be almost random 
anyway, without this being something to cause undue concern. 
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4.4.3 The Quality Factor of selected events 

As a tool to assist in deciding the final values of the cuts and showing the most in- 
teresting lightcurves to examine further, the second part of the selection pipeline 
was expanded to include a numerical tool which attempted to put the selected 
lightcurves in order of their "quality" with respect to confidence in the microlens- 
ing interpretation of the data. This was done according to the value of a "quality 
factor" constructed from a product of five factors, one for each major cut, quan- 
tifying how "well" (i.e. by how far) the lightcurve had passed that particular 
cut. 

Four of the five factors were those already used as cut parameters and hence 
defined above in Section 14.31 The fifth, called the "bumpsample quality factor" , 
or "BQF", quantifies how well the fitted bump is temporally sampled. 

The value of the "bumpsample quality factor" , "BQF" , is given by the product 
of the mean of the number of data points sampling each of the four quadrants 
defined below with a scaling factor representing the fraction of the total number 
of these four quadrants which contain at least one data point. The four quadrants 
used were four of those used in the definition of the bump sampling cut, i.e. if 

the number of data points within t <t < (t + <FV ^ HM j is nt, 

the number of data points within to > t > (to — tFW ™ ) is n7,and the number of 
data points within (t + *eshm.) < t < (t + 2t FWHM ) is nt 2 , 

the number of data points within (t — tFW „ HM ) > t > (t — 2ipwHM) is n~[ 

2 ' 

and the number of quadrants with at least one data point is n>o 

then the value of the bumpsample quality factor is given in Equation 14.111 



BQF = log [ 




(4.11) 



The reasoning behind this definition was that it was not only the number of data 
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points in the fitted flux peak which was important for modelling the lightcurve, 
but also that they were evenly distributed in time. Therefore even if one lightcurve 
had many data points, but they were all closely clustered in time, this should not 
be valued as highly as a lightcurve with the same or slightly fewer data points 
well spaced over the peak. Also, for modelling microlensing, good data coverage 
of both the central peak and the wings are required. The logarithm was taken to 
reduce the relative weighting of the BQF in the overall quality factor for larger 
values of the BQF, where the data points are so close together compared to the 
timescale of the Paczyhski peak that each extra point adds less to the information 
known about the peak than for lower values. 

The expression for the value of the quality factor "QF" is given in Equation 14.121 
in which the lensing signal to noise is represented by "LSN" the \ 2 difference 
ratio by "CDR" , as previously, and the "bumpsample quality factor" by "BQF" . 

QF = (x 1 2 oca i/d.o.f.)(x|obai/d.o.f.)(LSN)(CDR)(BQF) (4.12) 

4.5 Optimising the levels of the cuts 

It became clear that running the complete candidate selection pipeline each time 
the level of one of the five major cut parameters was changed, to investigate the 
effect of that change, would be an impractical proposition, given the number of 
possible combinations of the levels of these cuts that might need to be inves- 
tigated and the time the selection pipeline took to run. Therefore, a C-Shell 
script was written which mirrored the operation of the main cuts of the first 
part of the pipeline precisely, producing the same selected events for any level 
of cut. This was achieved by ensuring that the calculated values of any param- 
eters that were used to cut on were printed to file by the first ("main") part 
of the selection pipeline, along with the coded description of how the lightcurve 
had been classified. When the main pipeline was run, values of the cuts were 
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chosen which were very "relaxed" (in other words, they allowed through more 
lightcurves as microlensing candidates than could be conceivable for the final cut 
levels). However, the cut levels were chosen so that the events selected would 
still show a reasonable correspondence with the desired microlensing form, even 
if the list produced contained candidates that were of lower quality than would be 
expected from those in the final selection. The cuts were chosen to be relaxed for 
two reasons; firstly so that many examples of the kinds of lightcurves which are 
being considered were produced, to allow errors in the selection procedure to be 
more easily spotted and diagnosed, and secondly to ensure that the most detailed 
analysis required to produce the calculated values of the cut parameters (which 
are written to file and later read by the second stage pipeline script) is performed 
on a larger number of lightcurves that are likely to be selected when the cuts are 
later tightened. This in turn ensures that the second component of the pipeline 
always has the necessary data to work with. If the cuts had been chosen to be 
too tight for the main program run, some parts of the program might never be 
accessed because they are "inside" a cut condition which was failed, and hence 
the data required for the script to reproduce the results would never be calculated 
or written out. The initial values for the cuts which were chosen for this "general 
cut parameter calculation" run were as follows: global x 2 /d.o.f. = local x 2 /d.o.f. 
= 15.0, CDR = 0.1, CDR (for "long timescale events") = 0.01, "lensing ratio" = 
1.5. The global x 2 /d.o.f. and local xVd.o.f. cut values were set initially at 15 
as a value above which it was not believed any real confidence could be placed 
in the fits, since generally fits to data having values of x 2 /d.o.f.> 10 are not 
considered reliable. The value of CDR must be a positive number, and when set 
at 0.1 represents a 10% improvement in x 2 /d.o.f. by using the best fitting model 
used. This value was chosen to at a level which was lower than that at which it 
was anticipated the final cut would be set. Lensing ratio must also be a positive 
number and from theory should have had a value greater than at least 2 since 
this would represent approximately a bump which was on average 2a above the 
background flux level, so this cut level was set lower than this at 1.5 to allow it 
to be progressively tightened across the interesting range. All of the above cuts 
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were initially set at levels which it was felt would be slightly too lax to be used 
in the final selection, for the same reason. 

To get a feel for the limiting practical values of the cut parameters for the data 
set being used, the second stage pipeline script was run for 35 different sets of cut 
parameters. In each case all except one parameter were given the starting values 
above, and the last was varied steadily in the direction required to produce a 
stricter cut until no lightcurves were allowed through. The required range of the 
parameter being varied was found by experimentation; the cut level was made 
progressively more tight until no lightcurves remained selected. Several runs of 
the program were required just to discover the practical limit of each parameter. 

For the run of the pipeline (using the third season (2007 photometry) data, and 
the version of the pipeline which does NOT link the magnitudes of PA and (LT 
or FTN) flux amplitudes) the parameter values used and the resulting numbers 
of lightcurves selected, both in the "normal" and "long timescale" categories, are 
shown in Table H~2l below. Clearly, at least while the pipeline was being developed 
and checked, the values of the cut parameters each had to be less strict than the 
limiting values which did not allow any lightcurves through, as when used in 
combination, all the cuts together with their real values allow through fewer 
lightcurves than only having one "tight" cut as in Table 14.21 with all the others 
at their most relaxed base values. 

At this stage of the candidate selection investigation, the cut levels used by the 
pipeline emulator script were made significantly stricter, to make the selected 
lightcurves more likely to be good candidates. The cuts used at this stage 
were: global x 2 /d.o.f. = local xVd.o.f. = 10.0, CDR = 0.1, CDR (for "long 
timescale events") = 0.1, "lensing ratio" = 2.0. Using these cut levels, 62 "nor- 
mal" lightcurves were selected, along with 42 in the "long timescale" category. 
A quality factor was also calculated for the long timescale lightcurves in an anal- 
ogous way to the normal ones with the exception that there could only be four 
factors multiplied together as the lensing signal to noise factor is undefined for 
the long timescale class, by definition. 
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Table 4.2: Table showing the number of lightcurves passed by the selection 
pipeline for varying values of the main cut parameters. 
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It is interesting to note that the numerical values for the quality factor were of the 
same order of magnitude for the majority of lightcurves in both groups, despite 
the long timescale group having lost a multiplicative factor (although only usually 
of order 1) from their quality factor calculation. This was apparently due to the 
quality factor associated with the data sampling of the bump being on average 
larger for the long timescale group, as flux deviations which are longer in duration 
will on average contain more data points. Only two lightcurves (numbers 1033 
and 2675) at this stage were selected on the second iteration. 



4.5.1 Data sampling of the Paczyhski peak 

A cut was introduced to ensure that microlensing candidates had good data 
sampling of the area around the peak of the lightcurve in the region of to- The 
fitting routines have a tendency to insert Paczyhski peaks into gaps in the time 
coverage of the data, in order to reduce the overall \ 2 - This will usually happen 
when one or more data points on either side of the gap in the data are significantly 
higher in flux than those immediately outside of them with respect to the to °f 
the peak. This is usually a caused by random variations in the data. In a small 
number of instances, this kind of flux variation may actually be caused by a 
true microlensing event which just happens to unfortunately fall in a gap in the 
data. There is nothing that can be done in this case to distinguish whether the 
changes in flux are real or random and so it cannot be established whether a 
real microlensing event is occurring. Therefore, a good data coverage of the peak 
must be demanded to increase the confidence that the modelled event could be 
real. 

In the first instance, a routine was written which counted the number of data 
points within the ranges t ± £fwhm and t ± ' F ™ M . The routine also calculated 
the normalised distances At normjn = t p~^' M at which the nth data point exists, 
where n was calculated in the range 1 to 10. This was done to enable cuts over 
different time ranges than just t ±t FWH M or t ± f ™ HM to be used. For example, 
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if it was required that at least 5 data points exist within i =t 3£fwhm then the 
cut would be "At normj5 < 3" 

The minimum number of points which were required to exist in this time range 
was specified as a cut; initially, 3 points within to ± ^fwhivi was chosen. 

This worked reasonably well, but one example in particular forced a reassessment 
of whether this was an over-simple way to perform this cut. In one case, which 
was numbered lightcurve (LC) 1785, which happened also to be one of the few 
lightcurves in which some evidence of microlensing-like behaviour was found on 
the second iteration of the pipeline after the subtraction of variable behaviour on 
the first iteration, (i.e. found using the "/<&/ clustering" routine), the lightcurve 
was selected (not rejected) by the pipeline, and a x 2 /d.o.f.= 8.05 fit was found to 
the data using a Paczyhski fit. However, on examination of the fits, it was clear 
that this event, although a good fit to a very-short-timescale Paczyhski peak of 
^fwhm = 6.41 days was much more likely to be a classical nova candidate (from 
the very steep rise in flux). This lightcurve is shown in Figure POjI 

The reason why doubt exists about the explanation for this lightcurve is that the 
data coverage ends very close to the modelled t and data only exist on the low-t 
side of the event. This event demonstrated that if data only exist on one side of 
the flux peak it can be difficult or even impossible to characterise the lightcurve 
as either microlensing or, for example, a nova. 

With this event in mind, therefore, the cut on data coverage across the peak was 
changed so that a given number of points must exist on both sides of the peak, 
within symmetrical time ranges, as described in Section 14.31 above. The time 
regions (t Q + '™ M ) > t < t + 2tpwHM) and its equivalent on the negative side of 
to were chosen because these are the approximate regions in which a microlensing 
curve differs the most from a classical nova lightcurve; where the flux is not at 
baseline but not yet at peak. This is especially true for the flux rise, where 
novae rise much more steeply than a lensing event of the equivalent overall best 
fit timescale. It was also specified that data should exist close to the time of 
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modelled peak flux, as this region is very important in modelling the magnitude 
of the maximum flux rise from baseline. Hence at least one data point was also 
required within the range (t - ^™.) > t < t + fessm). 
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Figure 4.6: Two plots of Lightcurve 1785 in the 2007 photometry, a) The whole 
lightcurve b) The peak region (within t ±2tFWHM) showing more clearly the time 
sampling of the flux peak, insufficient to pass the selection cut. 



4.5.2 Long-period variation in the baseline of mixed mi- 
crolensing candidates 

At one point in the development process of the candidate selection pipeline, af- 
ter the quality factor classification had been introduced (see Section l4.4.3p . the 
lightcurves with the highest quality factor were being examined to ensure that 
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only lightcurves that were convincing candidates had been selected. It was noted 
that lightcurve 6149 had the second highest value of quality factor at that time. 
When the lightcurve was examined, however, although all the fits (shown in the 
first panel of Figure 14 .7p were correct, and the fit to the microlensing compo- 
nent (shown in the second panel of Figure 14.71) seemed to be exactly as would 
be expected for a lightcurve with a high quality factor, the overall lightcurve 
seemed insufficiently convincing to rate it as the second highest quality factor. 
After some thought about the possible reasons for this, it was decided that the 
amplitude of the microlensing component was not much larger than that of that 
of the variable component, coupled with a second factor that the period of the 
variable component was long compared with the time-span of the data. Only LT 
data were available for this object, giving a timespan of only 3 seasons, and the 
fitted period of the variable was 966 days; comparable to the span of the data. 
This meant that less than a whole period of the variable component could be 
within the data. This in turn meant that, although the joint skew cosinusoid + 
Paczyhski fit does fit the data well, this interpretation was not guaranteed to be 
the correct one since the behaviour of the variable part had not been properly es- 
tablished. An alternative interpretation might have been that there could be one 
or several variable star(s) which happened to brighten just when the Paczyhski 
peak occurred. Therefore, since confidence in the appropriateness of the joint fit 
depends to a large extent on the believability of the fit to the baseline, it was 
decided that this sort of lightcurve should not be allowed in the final selection, 
although with a longer time series of data the behaviour of the baseline could be- 
come better established, allowing the fitted Paczyhski peak to become a selected 
microlensing candidate. 

By analogy with previous works which demanded microlensing candidates have 
a flat baseline outside the Paczyhski peak, it was therefore demanded that the 
regular periodic nature of the modelled "baseline" variability be firmly established 
before the exceptional nature of the Paczyhski peak could be confidently claimed. 
To this end, a cut was devised that had two conditions either of which had to be 



4.5. Optimising the levels of the cuts 



118 



passed if the lightcurve was to get through. They were: 

1) The period of the variable component must be less than half the time span of 
the data (i.e. at least two periods must be contained in the data) OR 

2) The peak magnitude of the Paczyhski peak must be > 5 times the amplitude 
of the sinusoid. 

After this cut was first applied, the number of joint (mixed) fit lightcurves which 
passed through the main pipeline decreased from 213 to 169. The above two 
conditions were eventually modified to form Cut 4. 
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Figure 4.7: Two plots of lightcurve 6149 in the 2007 photometry: a) With its 
joint skew cosinusoid + Paczyhski fit, showing the long period variation in the 
baseline with comparable amplitude to that of the Paczyhski peak, b) With the 
variable component of the fit subtracted, plotted with the Paczyhski part of the 
joint fit. 
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4.6 Investigations of variable lightcurves and tele- 
scope flux ratios 

As part of the process of developing the pipeline and understanding of the data, 
several investigations have been carried out into lightcurves which were classi- 
fied by the selection pipeline as "variable". These initially included any kind of 
variability, not only periodic variables. At the start of these investigations, the 
lightcurves being worked with were from an earlier DIA run which had used a 
different reference image to the one used to create the data on which much of this 
thesis is based. The reference image relevant to the data used in this thesis was 
chosen, and the DIA pipeline re-run, in August 2007. Therefore all lightcurve 
numbers applicable at earlier times than this would not correspond to those after 
this date and so this relatively early stage of investigation will be reported in 
general terms without specific lightcurve examples. The selection pipeline used 
to classify the variables was also at a much earlier stage of development, and so if 
the experiments performed were repeated using the current pipeline, the results 
would differ. It is not thought, though, that the differences would change any 
of the conclusions reported below, because the routines used to fit the variable 
function, and the branch of the pipeline to which variable-type lightcurves were 
directed were not significantly changed. Most of the development work had been 
concentrated on the lensing branch. 

The selection pipeline was run with the following cut parameters: 

Only LT i'-band and FTN i'-band fluxes (or LT r-band and FTN R-band if used) 
were linked in magnitude: x 2 /d-o.f. = 10, x 2 ratio= 0.3, bump data sampling = 3 
points within t i ^fwhm • This produced 326 lightcurves selected as micro lensing 
candidates. 

Next the pipeline was run again, with tighter cut parameters: x 2 /d-o.f. = 10, x 2 
ratio= 0.5, bump data sampling = 3 points within t ± *™ HM . This selected 143 
lightcurves as microlensing candidates. 
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The skewness limits were set so as to preserve a simple monotonic rise and fall of 
the function and avoid the more complicated behaviour that emerges for higher 
skewness values. The limiting value was chosen by finding two lightcurves, one 
of which appeared to be near the borderline of acceptability, but on the right 
side of the border, and another which was slightly too skew to be acceptable. 
The values of skewness of these two lightcurves were 1.574 and 3.151 respectively. 
Therefore the skewness parameter was limited from this point onwards at 1.574. 
This investigation was performed using only skewness parameters S > 1 (see 
Section 14.2. 10p as at the time this was the only range being used. 



4.6.1 PA-LT/FTN flux amplitude ratios 

Previous to this point, the only knowledge that had been gained of the magnitude 
of the ratio between the PA flux measurements and those of LT and hence FTN 
had been from the exploration of the short event (see Chapter [5]), where, over a 
long baseline, good evidence of at least one variable star was found in the baseline, 
allowing the fitting of the flux ratio between the PA i' band variable flux amplitude 
and the FTN i' band flux amplitude as equal to 8.53, making the PA/LT ratio 
= 8.53/1.081 = 7.89. It was felt that relying on this one example and assuming 
that the same ratios applied for all lightcurves was not safe, and so efforts were 
made to find out with greater certainty what the relationships between flux and 
flux ratios actually were. This aim was of high importance as the ability to fix the 
flux ratio between two bands has several benefits. One fewer fitting parameter 
would be required, reducing the time spent by the fitting routine and reducing the 
mult i- dimensional parameter space by one dimension, which makes a considerable 
difference to how easily the lowest \ 2 solutions may be found. Also, since the 
time indices of data in the LT and FTN data do not overlap with those in the 
PA data, the only way the flux amplitudes in different telescopes or wavebands 
can be related is by using some long-term, repeating phenomenon which can be 
detected in both telescopes (such as variable stars). However, a microlensing 
event will only rarely be of sufficiently long timescale and have to close enough 
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to the boundary between the data groups to have enough of a detectable change 
in flux that it can be accurately characterised in both bands. Usually, data 
around the peak would be required to know the microlensing flux amplitude, 
which almost always will occur in only one band. Therefore, if knowledge of the 
average relationships between telescopes/wavebands is not available, then the 
peak fluxes of microlensing events must be fitted individually with independent 
fitting parameters. Because only the amplitude of the fit in the band containing 
t can usually be believed, this makes appropriately scaling data to be able to 
display all of the data points from all bands on one time axis very complicated, 
especially in the case where a mixed fit has been applied, and the lensing peak 
has a similar amplitude to the variable amplitude. 

For example, a microlensing candidate with to in the PA data might be fitted 
in LT i'-band with a very large amplitude (much larger than would be expected 
from the average expected flux ratio between PA and LT). The reason for this 
might have been that the data at the beginning of the LT lightcurve appear to 
have some random positive trend. 

In this case, simply subtracting this component to leave the variable component 
does not give the correct best fitting variable component as the two components 
were fitted jointly, each with "knowledge" of the existence of the other. Hence 
only one constant flux offset parameter would have been used, which in this case 
would be large and negative, to almost cancel out the magnitude of the large 
lensing peak, to just leave the small positive random swing visible in the LT 
data. 

Reducing the incidence of these "extreme flux ratio" fits was the motivation 
behind limiting the acceptable range of flux ratios for mixed fits, and regrading 
those with extreme flux ratio values to simply be variable star candidates. Except 
when it is known that the telescopes and filters are truly identical, ideally it 
should not not be automatically assumed that the flux ratios that should be used 
for the variable component and for the Paczyhski component of a mixed fit are 
identical. Although the PA i'-band and LT/FTN i'-band filters should be very 
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close, they will not be identical, and since the source star for a microlensing event 
will almost always not be the same star as the variable component, therefore 
usually having different colours and hence flux ratios, there may still be small 
differences in the flux ratios for microlensing components and for variable star 
components. Moreover, although the flux ratio between wavebands is expected 
to be constant for microlensing events as these are achromatic, in general the flux 
ratio for variable stars would be expected to change slightly, as the temperature 
of the star varies with time as it expands and contracts. Since both bands used 
here are i'-band, in this case these differences will be negligible. For the purposes 
of applying any consistent flux ratio relationship that might be found, it was 
necessary to assume that the same ratio be applied to both components, because, 
as described above, Angstrom does not have any data which overlap with the 
PA to allow the flux ratios between PA data and other bands, for microlensing 
components, to be investigated individually, or overall, in a more direct fashion. 

Therefore, since ideally, if a simple relationship did exist between the fluxes in the 
PA and LT/FTN data it would be useful to know it, to avoid all the complications 
above and make the fitting faster and more accurate. This was attempted by 
examining fits to lightcurves with purely periodic variability i.e. no suspected 
lensing components. The flux ratios in this section were all calculated and are 
quoted as if they were between PA i' and FTN i' bands, which is because the 
scripts were written when three data bands were being used. 

This investigation was begun by selecting lightcurves from the previous run of 
the pipeline where % 2 /d.o.f. = 10, x 2 ratio= 0.5, bump data sampling = 3 points 
within ± ' F ™ , in other words, "good variable candidates". This selected 341 
lightcurves. 

A further selection was made from this list of lightcurves which might be expected 
to have the best chance of having a well defined flux ratio between PA i'-band and 
either LT and FTN i'-band fluxes, based on the number of points in each band. 
These criteria were later improved upon and described in more detail below. 
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In an attempt to further increase the sample, the original lightcurve plots were 
re-examined, looking for "mixed" events which had clearly variable baselines. 13 
were selected as suitable and these were added to the original 341 lightcurves. 

18 lightcurves were selected with sufficient data and non-zero fitting errors. The 
calculated mean flux ratio of these 18 lightcurves was 8.53, with a standard 
deviation of 5.69. The contribution to the overall uncertainty in the mean from 
the fitting errors was only l^i? an d so negligible compared to the scatter. 

Although the magnitude of the amplitude seemed reassuringly close to what was 
expected from investigations of the short event, the large spread was worrying. 
Closer examination by eye of the selected lightcurves showed that on the whole 
they were low signal to noise variables. This could be explained because variables 
with large signal to noise would often have other detectable variations in addition 
to the simple sinusoid modelled by the fitting code and so would fail the x 2 /d.o.f. 
cut. 

Therefore, a method seemed to be required to classify variable lightcurves accord- 
ing to the "amount" of variability present. In other words, some kind of of variable 
signal to noise quantifier was required so that the more "interesting" variables 
could be preferentially selected and examined more closely without having to go 
through all of the thousands of lightcurves which were classified as "variable" . 

One method which would have been possible to implement would have been to 
take the amplitude of the best fitting sinusoid curve and to compare this with 
a measure of scatter, such as the standard deviation, of data points about this 
curve. A lightcurve which is purely noise would have a large scatter but the 
amplitude of the sinusoid would be very low, whereas a lightcurve which is a very 
good fit to a sinusoid curve would have a finite sinusoid amplitude but a low 
standard deviation around that curve. This method was not chosen for several 
reasons. Firstly, through examining a random sample of lightcurves classified as 
"variable" , it had been noticed that a few of them had extremely large, apparently 
random flux variations, but were not well fitted by a sinusoid curve. 
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In these cases the scatter is huge, but the amplitude is very low, so they would not 
be picked out by the method described, and in fact would be actively "suppressed" 
in their variability classification. These lightcurves are interesting to know about, 
so that they can be either removed from the list and/or investigated, and so 
incorrectly classifying their variability would have been undesirable. Secondly, the 
method relies on the assumption that all variable stars that might be interesting 
are a good fit to the chosen sinusoid model, because otherwise a lightcurve might 
be a very interesting periodic variable which just happens not to match a simple 
sinusoid. This would make the scatter from the sinusoid large (but not random; 
rather, coherently clustered in time), and thereby reduce the significance of the 
signal to noise classification. 

The method which was preferred made a comparison of the scatter of data points 
in the lightcurve from the mean flux with the mean error over the whole lightcurve. 
(The mathematical form of this quantity is shown above in Equation 14. 8p . This 
had the advantage that it should pick out the large signal to noise random vari- 
ables, but also works for regular periodic variables. 

In addition, a cut was made on the fraction of data points above a given multiple of 
the mean error away from the mean flux. Initially, a multiple of 10 times the mean 
error, with 20% of points being above this level was demanded. This resulted in 
21 lightcurves being selected. This was tested by running the main pipeline for 
this list of lightcurves and examining the plots. 18 of the 21 lightcurves were 
classified as variables and 3 as some kind of microlensing candidate, either mixed 
or pure Paczyhski. The method seemed to work well and so it was incorporated 
into the main selection pipeline. 

The number of points in both PA and the total of all LT and FTN data present 
were required to have a minimum number of data points to ensure that each band 
would have sufficient points to properly define the flux amplitude in that band 

1) total number of points > 250 2) PA points > 80 3) total of other bands > 80 

An additional constraint was introduced that the MINUIT fitting errors were not 
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allowed to be zero (which is often an indication that a fit has not fully converged). 

The two variability cuts described above were also applied, but with slightly 
relaxed levels of 

a) Fraction of points above n(E) away from the mean = 0.15 b) where n = 7.5 
and E is the mean error on the data points. 

With these cuts, 62 lightcurves were selected: These were examined by eye, se- 
lecting those which had a reasonably good fit to the skew cosinusoid function and 
in which variations can clearly be seen in both PA AND in LT or FTN. 

After this step, 33 lightcurves remained. When lightcurves with zero fitting errors 
were rejected as before, only 10 remained. 

The mean flux ratio obtained from these lightcurves was 25.72, and the standard 
deviation was 13.77 with fitting errors 1q!i5- Clearly this is not very compatible 
at ±1(7 with the earlier result. The largest value was 74.4, from LC 2004 and the 
smallest value was 6.89 from LC 10685 - a very large range indeed. The plots of 
the model fits to these two extreme ratio lightcurves were examined by eye and 
were reasonable fits to the data and consistent with the above ratios. 

The next experiment tried was to attempt to use larger numbers of lightcurves 
(better statistics) to see visually whether any correlation existed between, for 
example, PA flux amplitude and flux amplitude ratio. Clearly, if the ratio was a 
constant, this correlation would be a horizontal straight line at the value of the 
ratio. Any non-constant correlation would imply a more complicated relationship 
was present. The classifications given to lightcurves by the selection pipeline were 
used to select lightcurves that were classified as variables. Instead of requiring a 
given value of x 2 /d.o.f. and a low y 2 fit to a skewsinusoid fit, however, neither of 
these conditions were applied. This allowed 11341 lightcurves to get through the 
first stage. These consisted purely of lightcurves for which a variable fit was the 
lowest x 2 of the ones fitted. 5818 of these passed the above cuts on the number 
of points in each band, and 1185 of these had non-zero fitting errors. 
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The flux ratios of these 1185 lightcurves were plotted in a scatter plot with their 
PA variable flux amplitude on the x-axis. The resulting plot is shown in Figure 



14781 below. 
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Figure 4.8: Scatter plot of PA variable flux amplitude versus flux amplitude ratio 
for 1185 variable lightcurves. 



It could clearly be seen in this figure that there was some kind of correlation 
between PA flux amplitude and flux ratio which was not described by a constant 
function. However, the correlation was very wide, and had much scatter, which 
was not really a surprise considering the deliberately relaxed criteria used to select 
lightcurves. It appeared that the correlation passed through, or close to, (0, 0) 
and was possibly approximately linear. The densest part of the main mass of 
points was indeed around the 5 — 15 range that had been repeatedly occurring in 
the above experiments. The wide range of values obtained so far was consistent 
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with the wide range of the main mass of points on the plot. 

At a later date the issue of flux ratios was returned to, in an attempt to gain 
a better understanding of these issues. In order to be more certain whether the 
wide spread in flux ratios found in the first two investigations above was due 
to the method preferentially selecting low signal to noise lightcurves, a similar 
investigation was carried out, but using an additional cut on the variable signal 
to noise parameter (as defined in Equation 14. 8p of S vauT > 2. This resulted in 80 
lightcurves being selected, which was reduced to only 5 when the cuts above on 
the number of points in bands and non-zero fitting errors had been applied. The 
mean flux ratio from these 5 lightcurves was found to be 9.14 with a standard 
deviation of 3.31 and fitting errors ~to%4 

The process was repeated using as a base group the list of 355 lightcurves with 
any value of variable signal to noise parameter, i.e. not necessarily S var > 2. 
The value for the ratio now obtained was 10.42 with standard deviation 5.74 and 
combined fitting errors ]to26- 

The several investigations conducted so far, although not seeming to converge on 
one value, did seem to be consistently indicating a value for the flux ratio which 
was < 20. 

The above method was repeated later, after small changes had been made to 
the pipeline. This time, 6 lightcurves were selected. However, only 3 of them 
coincided with the previous 5 above. The discrepancies were all checked and 
were due to the parameter changes in the pipeline. The flux ratio from these 
6 was found to be 13.12 with standard deviation 7.67 and fitting errors loia- 
The three overlapping lightcurves had identical fit parameters, so it was possible 
simply to combine the lists of 5 and 6 lightcurves into one list of 8. The fits to 
these lightcurves were checked by eye. Mostly they were reasonable, but it was 
clear that their larger values of variable signal to noise parameter were mostly 
caused by possessing one or a few very discrepant data points, rather than the 
whole lightcurve having a larger signal to noise overall. 
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The possibility of the existence of a coherent spatial variation in the flux ratio 
between LT and PA data was checked for by plotting points colour-graded by 
amplitude in an X-Y scatter plot of the 1185 data points earlier selected for 
Figure 14.81 above. 

Scatterplot of Periodic Variable LC positions, colour — coded by flux ratio 
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Figure 4.9: Scatter plot of the spatial distribution of the 1185 lightcurves previ- 
ously selected for Figure 14. 8[ colour coded by the flux ratio between the PA and 
LT variable flux amplitudes. The flux ratios corresponding to the colour bands 
are: Blue: 0-5, Green: 5-10, Yellow: 10-15 and Red: 15 and over. 

The overlap regions between the two PA image fields which overlap with the LT 
and FTN fields were clearly delineated in Figure |4T9| but no pattern or correlation 
could be seen in the colours. This strongly implied that a spatial correlation was 
unlikely to be found. Nevertheless, the equivalent plot was also made using the 
eight lightcurves selected above. No spatial pattern could be seen in the low 
number of data points. 
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Instead of selecting and classifying lightcurves based upon the signal to noise 
parameter, which after all was known to be sensitive to non-periodic variations 
and discrepant points, it was wondered whether a better selection criterion might 
be the PA flux amplitude itself, which should perhaps be more closely correlated 
with the "true" variables. 

By this stage, only two bands of data were being used for the fitting; LT i'-band 
and PA i'-band. The script used to select variable lightcurves was modified so 
that it first selected lightcurves classified as variable that had x 2 /d.o.f. < 15, and 
then applied the cuts on the number of data points 

1) total number of points > 250 2) PA points > 80 

Next the lightcurves were grouped according to PA flux amplitude allowing the 
possibility that the fitting errors might be zero. The flux ratio was again calcu- 
lated and is quoted as if it were between PA and FTN bands, which is because 
the scripts were written when three data bands were being used. The categories 
were defined as int( PA flux i ^ n P litude ) , e.g. category contained PA flux amplitudes 
— 10, category 1 contained PA flux amplitudes 10 — 20, etc. On inspection of the 
flux ratio and amplitude information, 5 lightcurves were discarded because some 
of the data were zero, implying a bad convergence of the fitting. The numbers of 
lightcurves found to be in each amplitude category are displayed in Table 14.31 

It was interesting to note that the numbers of lightcurves in each category rose 
with falling category number, until about category 1, when they began to fall. It 
would be expected that fainter stars would always be more common than brighter 
stars, at least down to the detection limit, so it appeared from this table that 
the decline in numbers of variable candidates with lower amplitude variability 
could best be explained by the lower signal to noise categories being too faint to 
be identified consistently. Since the flux ratio between PA and LT was known 
to be of order 10, any lightcurves classified as having PA flux amplitude < 10 
will have LT flux amplitude ~ 1 ADU/s, which is certainly extremely hard to 
detect, given the known noisy LT data. Next, the mean flux ratio of all categories 



4.6. Investigations of variable lightcurves and telescope flux ratios 



130 



Table 4.3: Table showing number of variable lightcurves found to be in each PA 
flux amplitude bin, where each bin is of width 10 ADU/s. 



PA flux amplitude 


number of 


Category 


lightcurves 
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containing more than 1 lightcurve was calculated, along with the standard error 
on the mean, given by the standard deviation about the mean divided by the 
square root of the number of lightcurves. Due to the low statistics, the errors on 
the mean were large in most cases, but for almost all data points the error was 
less than the value of the mean. In addition, in an attempt to improve the flux 
ratio errors (at the expense of the PA flux errors), categories 10-13 were grouped 
together, and categories 14-21 were grouped together, the two groups therefore 
containing 11 and 8 lightcurves, respectively. The mean PA flux values of each 
data point were calculated using the actual values, and in an analogous way to 
the flux ratio, the standard errors on the mean were calculated. It seemed clear 
that these would be under-estimates of the true flux spread, since each data point 
comes from a pre-selected non-random group with a range limited to 10 ADU/s. 
The now 19 data points were plotted in a PA flux vs mean flux ratio scatter plot 
(comparable with Figure |4TH|) . These are the data shown in Figure I4TTU1 

It had been observed that the estimated error in the PA flux direction of the last 
data point (the 21st flux category) was much smaller than for any other category. 
Indeed this was found to be because the PA flux amplitudes for the two specific 
lightcurves involved (5289 and 7081) were extremely similar, although their flux 
ratios were very different. On checking the positions on the sky of these two 
lightcurves, it was found that their positions on the LT field were both close to 
(—17,57). Close examination of the two lightcurves showed that the PA data 
were practically identical, whereas the LT data were different, although similar 
in form, implying that two different (but adjacent) LT objects had been matched 
with the same PA object at different times. This introduced some doubt over the 
estimate of the errors in both directions, as the two lightcurves are only semi- 
independent data points, although the magnitude of the flux ratios derived still 
look sensible on average. To be safe, this data point was not included in the 
following work. 

A clear correlation can be seen by eye in Figure 14.10} although finding a line 
of best fit which satisfactorily describes the data was made harder by the way 
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all the points with small flux ratio spreads are clustered at the bottom left of 
the plot. This drags any fit which includes the errors downwards, so that the 
majority of the data are above the trend line. In Figure I4.10[ the various fits 
that were performed are shown. In summary, the blue lines were found by linear 
regression based on the unweighted mean (assuming throughout that all errors 
are equal). The red lines were again found by linear regression, only again forced 
to go through the mean weighted by the flux ratio errors. The third set of lines, 
which are green, show the best fit lines obtained by a full x 2 minimisation fit, 
using the flux ratio errors only. There are always two lines of each colour because 
the calculations were performed both with and without the first (Category 0) 
data point, which also has some doubt attached to it, for the reasons described 
above. As would be expected, not including this first point allows the gradient of 
the lines to move away from where the data point once was on the left hand side, 
meaning that the fits performed without the first data point have lower gradients 
and higher offsets. 

Possible fitting functions other than simple straight lines were considered, for 
example a logarithmic curve, but it was felt that not including the first point 
removed any motivation for trying this as it was the point which had the flux 
ratio which was the lowest relative to the others. 

It was felt that the range of the six curves described above nicely bracketed the 
range of fit lines which might be described as "possible" . The seventh line, which 
is coloured black, has the mean gradient and offset of the previous six, that is: 
Gradient = 0.08038, offset = 5.98 

The equivalent parameters for the x 2 minimisation fit are: 

1) Including Category point: 

Gradient = 0.087 ± 0.017 offset = 3.86 ± 0.56 
X 2 = 59.54, rVd-o.f. = 3.72 

2) Not including Category point: 
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Gradient = 0.046 ± 0.010 offset = 6.4 ± 1.0 
x 2 = 50.90, x 2 /d.o.f. = 3.39 

It is not clear whether the underlying behaviour of the flux ratio with PA flux does 
flatten out as PA flux increases beyond 200 ADU/s, but all proposed linear fits 
stay within the la flux ratio error of the final unused data point, which provides 
some confidence, even though this data point is not a perfect fit, that continuing 
the linear fit is not wildly wrong. 

However, this question must remain unanswered for the moment, as variables 
with larger flux ratios (and good fits to a skew cosinusoid) apparently do not 
exist in the data, and other kinds of variables are not currently detected by the 
selection pipeline. 
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Fitted PA variable flux amplitude vs PA/FTN flux amplitude ratio 




Figure 4.10: Scatter plot showing the mean PA/FTN flux amplitude ratios found 
for each PA flux amplitude bin, together with the various linear fit lines calculated. 
The two blue lines were found by linear regression based on the unweighted mean 
(assuming throughout that all errors are equal). The red lines were also found by 
linear regression, but forced to go through the mean weighted by the flux ratio 
errors. The third, green, set of lines show the best fit lines obtained by a full x 2 
minimisation fit, using the flux ratio errors only. There are always two lines of 
each colour because the calculations were performed both with and without the 
first (Category 0) data point. 



Chapter 5 



First Results and Analysis of the 
Angstrom Project Dataset 

5.1 Introduction 

In this Chapter the main results of the research described in this Thesis are de- 
scribed. The results of investigations into one particularly interesting transient 
event, of high signal to noise and short timescale are presented. Some lightcurves 
which are highlighted by the pipeline as having a relatively high signal to noise 
bump, but which do not fit the point-source point-lens lightcurve model well 
enough are shown. These may include classical novae. Investigations into the 
properties of variable star candidates are described, and the information about 
their spatial distribution that has so far been extracted is also described. The 
lensing candidates selected by the pipeline are shown, along with summaries of 
their properties. Finally, a test of the pipeline, conducted using "fake" microlens- 
ing event lightcurves, is described. 
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5.2 Comparisons with previous surveys 



Since we use the POINT-AGAPE data to extend the Angstrom survey baseline 
it is interesting to ask questions like: "Do any of the published POINT-AGAPE 
events lie in the Angstrom field?" and if so, 'Are they detected as variable ob- 
jects independently by Angstrom?". If any of these microlensing candidates had 
either repeated, or had showed some clear high signal to noise variation since 
the end of the POINT-AGAPE survey, this would be very interesting, as the 
first possibility would either most likely eliminate the event as a possible lens- 
ing event, or be an exc eedingly rare repeating lensing e vent (possibly a bina ry 



event as investigated bv lJaroszvhski and Skowronl (120081 ) 



Skowron et al. 



(|2Q09|)), 



and the second possibility would cast some doubt on w hether the original event 



was d u e to microlensing. The PO 



(J2003|) 



Belokurov et al 



d2005|) and 



NT-AGAPE papers 



Calchi Novati et al. 



Paulin-Henriksson et al. 



(J2005|) have been checked 



and the positions of the lensing candidates reported therein have been converted 
to arcseconds and compared with the limits of the Angstrom field illustrated by 
Figure 13.21 The POINT- AGAPE field is considerably larger than the Angstrom 
field, and none of their events have positions within it. 

Hence, the LT or FTN data have no power to confirm (or otherwise) the possible 
nature of the POINT-AGAPE events - however it is possible that the BOAO 
or Maidanak data, with their larger fields or view, may be able to assist in this 
quest. However, from the point of view of the Angstrom Candidate Selection 
Pipeline, object detection is only performed in the areas overlapping with the 
LT and FTN reference images, so, none of our events can correspond to these 
previously detected events. Obviously, this means that the selection pipeline is 
not able to re-detect these events. 
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5.3 The very short microlensing event ANG- 
06B-M31-01 



5.3.1 The Angstrom Project Alert System (APAS) 



The event described in this paper wa s detected in t 



Angstrom Project Alert System (APAS) (IDarnley et ai- 



re firs t instance by the 



2007al ). This is a pipeline 



which analyses the data in "real time" (see Sect ion 13.21). I t is an alogous to 
those used by Mil ky Way searches such as OGLE f jSumi et all 120061 ) and MOA 



(jSumi et al 



2003J). Each day, the APAS presents to the observer a limited list of 



typically no more than a few dozen lightcurves, via a web interface. These are the 
most interesting transient signals found in the database of lightcurves produced 
by the difference imaging. Currently this database contains of the order of 90000 
lightcurves and during the observing season is updated each day. The observer 
is then able to review the list and decide which, if any, are worthy of further 
study. In order to allow the assessment of possible lightcurve contamination from 
close-by objects, the APAS also records the positions of all variable signals in the 
immediate neighbourhood of the lightcurves in the list. 



5.3.2 Data and Analysis 



Data 



In addition to almost 3 seasons of LT i'-band and one season of each of LT 
r'-band, FTN i'-band and FTN R-band Angstrom data for the position of this 
particular lightcurve, archive i'-b and data from the POINT-AG APE dark mat- 



ter microlensing survey of M31 (IPaulin-Henriksson et all 120031 ) were available 



POINT-AGAPE used the wide-field camera of the 2.5m Isaac Newton Telescope 
on La Palma to survey a 0.6 deg 2 area of the M31 disk and bulge. Around 65% 
of the LT/FTN reference field overlaps with the PA data. All Sloan i'-band data 
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from the PA survey obtained between 1999 and 2001 have been reprocessed using 
the Angstrom Data Analysis Pipeline (Section 13. 3p . These data are very useful to 
us as they provide an extended data baseline for the subset of light curves lying 
in both Angstrom and PA survey regions to enable better discrimination between 
variable stars and microlensing events. 



Analysis 

Visual inspection of the data confirmed the presence of a clear short timescale 
spike in the 2006/7 season of LT i' data. There is also an obvious periodic 
variation of the baseline of the event which is also present in the PA data. These 
features can be seen in Figure 15.111 

It is clear from inspecting the difference images that this is a real astronomical 
event and is not caused by some exotic data processing effect. The relevant 
images bracketing the flux spike, spanning the dates 11th - 20th September 2006 
are shown in Figure 15.11 

Therefore, the event is modelled by fitting the flux with the sum of a reduced 
Paczyhski curve (with flux varying with time as given in Equation 14.21 where t 
and to are the epochs of observation and maximum brightness, B is the baseline 
flux, AF = F(to) — B is the maximum flux deviation and £fwhm is the duration 
of the full-width at half- maximum), and a skew cosinusoid function. The CERN 
MINUITg software is used to minimise the \ 2 °f the fit. The skew cosinusoid 
function used is designed to be capable of being adjusted by the fitting routine 
continuously from zero skewness (sine curve) to high skewness. The function 
chosen is defined by Equations ( 14. 3 p and (14. 4p , in Chapter HJ 



1 http : / / wwwasd. web . cern. ch / wwwasd /cernlib/ 
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Figure 5.1: Portions of the Angstrom difference images covering the main flux 
spike. The first image in the series is 11th September 2006; the last is 20th 
September 2006. The image sequence is left to right, top to bottom. The seven 
central points where the source was brightest span 14th-16th September. The 
scale of the images is given in the first image: (each sub-image is approximately 
1/12 of the area of an LT field). The red line in the top left image is roughly 10" 
in length. The white area at the bottom and left of each image is the corner/edge 
of the LT field difference images. 
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5.3.3 Fitting Details 

This event was first fitted in 2006, and was subsequently re- investigated each 
time the Angstrom pipeline was significantly improved and the lightcurves re- 
generated. 

The following describes sequentially the work that has been done to investigate 
the behaviour of this lightcurve, concentrating to a greater extent on the analysis 
of more recent data which were of noticeably better quality than the earlier data. 

Lightcurve copies 

In the DIA pipeline, new objects are defined as soon as a localised variation in flux 
appears which is not consistent in position with any previously detected object 
position. In many cases, there exist more than one defined varying object within 
the same PSF for a given epoch. As the fluxes of these objects vary independently, 
the centroid of their combined PSF moves back and forth so that it is nearer to 
the object which happens to be the brightest at that moment. Sometimes, the 
centroid of a PSF moves so far from a previously defined object that it can no 
longer be considered consistent, to within the positional estimation errors, with 
being part of the same object. At this point a new variable object is defined. 
However, the fluxes for the (now two) objects are drawn from the integrated flux 
of the whole PSF. Hence, although the objects may be positionally independent, 
photometrically they are almost identical as their individual PSFs overlap almost 
entirely. Therefore, in many cases the lightcurves of objects which are very close 
in position to one another are very similar, or sometimes identical. 

5.3.4 Investigating the variable component 

Firstly, an eight parameter skew cosinusoid fit was made to the above three band 
data, with the region (53982 < t < 54006) (JD - 2400000.5) around the possible 
Paczyhski spike masked out. The best fit parameters for this are shown in Table 
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Table 5.1: Fitting parameters for best skew cosinusoid fit to "short event" data 
in the 2007 photometry, not including possible Paczyhski spike. 



Parameter 


Value 


MINU1T 


MINOS errors 


Name 




fitting error 


(if calculated) 


00 


53210.92 


0.70 


+0.69-0.71 


P 


245.29 


0.14 


±0.14 


S 


1.155 


0.053 


+0.021-0.016 


Blt 


9.01 


0.17 


±0.17 


Bftn 


2.03 


0.19 


±0.19 


AFpTN.sin 


-7.55 


0.20 


±0.20 


-Bpa 


-18.62 


0.91 


±0.91 


A-FpA,sin 


-88.6 


1.6 


±1.6 



15.11 This fit had a x 2 °f 2237.85, which corresponds to a x 2 /d.o.f. of 4.61, as 
only 494 data points were included. All these parameters are consistent with 
the equivalent ones found using the earlier 11 parameter fit, which has not been 
quoted here. 



Periodogram Analysis 



Due to the way the annual gaps in the data occur, it is quite easy to see a 
period in the data (especially in the PA data) equivalent to roughly 750 days. 
This effect is apparently caused by aliasing between the above periods and the 
annual observing cycle as several obvious resonances can be calculated. To pro- 



vide further supporting evidence for the best fit p eriod 



subjected to a Lomb-Scarg 



Shobbrook and Lomb 



1972 



e periodogram ana 



Shobbrook et al 



ysis (ILomb 



bund, the dat a were 



1975 



Scargle 



1982 



19721 ) which can cope with non- 



regularly spaced data such as ours, unlike Fourier Analysis. The ratios found 
above between AF L T, S in, AF FTNjSin and AF P A iS in were used to scale the measured 
variable fluxes (with the Paczyhski peak still masked out) to a consistent level, 
and then the data from all three bands were concatenated and ordered in date 
order to form one combined data set. The periodograms of these combined data, 
plus the individual bands, were taken. Also, the above skew cosinusoid fit was 
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Figure 5.2: Periodogram of scaled combined variable signal for the event ANG- 
06B-M31-01. 

subtracted from the original raw data, and the residuals were scaled and com- 
bined in the same way, and periodograms again taken. The periodograms of the 
scaled combined data confirmed that there were significant periods in the data 
at P = 245.8 and P = 736.1 days as well as some other less significant periods. 
This is shown in figure 15.21 The data were folded at all significant periods found 
as another check on the periods indicated by the periodogram. The light curve 
folded at a period of P = 245.8 days is shown in Figure 15.31 

Folding the data at the most significant periods contained in the periodogram 
of the residuals to the best skew cosinusoid fit showed no evidence for a second 
period in the data. 

When the best fit skew cosinusoid has been subtracted from the original data, the 
baseline of the Paczyhski peak does indeed look flat within expected statistical 
variation. This can be seen in Figure 15.41 

The periodogram of the residuals shows that no significant periods remain in the 
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da.ta folded at pei-iod = 00245.58880 
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Figure 5.3: Plot showing the variable star component of the microlensing can- 
didate ANG-06B-M31-01. All data with flux spike subtracted; each band scaled 
by the amplitude of the skewcosinusoid fit, interleaved in time and folded at P = 
245.8 days 

data. 

5.3.5 Investigating the Paczyhski peak 

The remaining spike in the data after subtraction of the variable component 
was then modelled individually using a full Paczyhski fit in order to investigate 
whether it was possible to extract all the lensing parameters. An eight parameter 
fit was attempted, using to, tg, the Einstein crossing time, /3, the minimum 
impact parameter, 5 LTjPac , -pFTN,pao ^FTN.pac, ^PA, P ac and 5 PA ,pac- The Paczyhski 
amplitude in the PA band was allowed to vary freely this time. Obviously, if 
the subtraction of the variable had been done correctly we would have expected 
the baseline parameters to be approximately 0, and indeed they were all much 
less than 1 ADU/s. The data could be fitted fairly well with a full Paczyhski 
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Figure 5.4: The residuals after removal of the variable fit to the baseline of ANG- 
06B-M31-01 

profile, with a best fit \ 2 °f 3221.34, corresponding to a x 2 /d.o.f. of 6.29. There 
was insufficient information in the lightcurve to find a unique solution, however, 
with solutions having equally good \ 2 being found over a wide range of beta 
values. All the fits to the short event lightcurve presented in this section may be 
seen to have xVd.o.f. values which are considerably higher (i.e. x 2 /d.o.f.^> 1) 
than would normally be expected from a model fit which correctly explains the 



data. We believe that this may be 
investigation soon to be described in 



explained by the 



Kerins et al. 



acts discovered by the 



(]2009f ). This work has found 



that the intrinsic noise in the data after the difference imaging process is 40% 
higher than pure photon noise. This in turn implies that all \ 2 and x 2 /d.o.f. 
described within this thesis as fits to Angstrom data may effectively be divided 
by a factor of 1.4 2 , i.e. 1.96. In the degenerate region, 

2y/3pt E = 7.69 ±0.03. As 

/3 changes, so does the lensing amplification, and hence the original flux required 
by the fit to reproduce the rise in flux SF observed. 
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Table 5.2: Table showing the parameters of the modelled Variable component 
which was subtracted from the data before the lensing peak was modelled. 



Parameter 


Parameter 


Description 


Value 


Phase 0o 


53148.02 


Period P 


244.99 


Skewness S 


-0.875 


LT flux amplitude A\ 


-9.26 


LT flux offset AFi 


7.44 


FTN flux amplitude A<i 


-8.57 


FTN flux offset AF 2 


2.10 



5.3.6 Adding in the Maidanak data 

Early in 2008 the data from the Maidanak telescope were processed. We had 
been aware for some time that these data fortuitously spanned the event time of 
the "short event" , and so being able to utilise these data to inform the interpre- 
tation of this event and hopefully cut down the degeneracy of the model solutions 
described above was very important. 

Unfortunately, the Maidanak data have only 40 data points, spanning a little 
under 11 days so it proved not possible to do a full combined Pac + Cosinusoid 
fit for all the data simultaneously, since the amplitude of the variable component 
related to the Maidanak data was hardly constrained at all by the data. 

Therefore, a different method was required. First, the previously found (see 
description of earlier work above, and Table 15. 2p variable contribution to the LT 
and FTN data was subtracted from those two bands, to leave only the modelled 
microlensing components. As can be seen from Figure 15.51 this function was a 
good fit to the baseline variability. Nothing was subtracted from the Maidanak 
data at this stage for the reason given above. 

Then the data were fitted with a full Paczyhski curve. The resulting amplitudes 
of the Paczyhski curves of the best fit were taken as an indication of the flux ratios 
between the bands. As can be seen from Figure 15.61 there was a clear excess flux 
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on the left hand side of the Maidanak flux peak which was fitted in the first 
iteration, corresponding fairly convincingly to the downward slope with time of 
the currently un-subtracted variable component, as can be seen in Figure 1531 A 
closer zoom in on the central peak area is shown in Figure 15.7} which highlights 
the large scatter the magnitude of the error bars) on the data points in and 
after the peak. This will be discussed further later in this Chapter. 



LT(i),(MAiDAKAa[i) and mi(i)(a<s6ied)) lc 012147 -Joint p&s/skew-sm Fit 




5.34* Iff 1 



JD-24000QO (days) 



5,4x1 0" 



Figure 5.5: The lightcurve of the short event, using LT (Blue), FTN (Red) and 
Maidanak (Black) data. These are the raw data as they were before the process 
of scaling the Maidanak data to the other data had begun. 



In order to make a first order estimate of the unknown variable contribution in 
the Maidanak data, despite the short time span of those data, a constant frac- 
tion of the variable component function from the LT fit was subtracted from the 
Maidanak data, and the resulting flux re-fitted with the full Paczyhski curve. 
The constant fraction was iterated manually, attempting to minimise the value 
of x 2 °f the fit. The minimum value of \ 2 was found when the fraction was equal 
to 0.45. No flux offset of the variable component was introduced as this would 
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Figure 5.6: A moderate zoom in on the flux peak region of the lightcurve of the 
short event, using using LT, FTN and Maidanak data. These are the raw data 
as they were before the process of scaling the Maidanak data to the other data 
had begun. 

only introduce an offset into the Maidanak flux which would then be fitted by 
the Paczyhski curve fitting. By eye from Figure I5.5[ though, any offset anyway 
appeared to be small, as the Maidanak and LT points in the steep rise lay almost 
on top of each other. The FTN data were also fitted to assist with the character- 
isation of the baseline flux. As in the work above, the LT and FTN fluxes were 
linked, reducing the number of parameters by one. 

The x 2 does not change very quickly with the chosen fraction and this is probably 
to be expected given the short timescale in the Maidanak data over which any 
change in the flux gradient present due to the variable component can act. Hence 
this must be considered a very preliminary result which would almost certainly 
change given a longer Maidanak time series. 

Once the amount of variable component to subtract had been optimised, the 
resulting best fit parameters (with fitting errors) were as shown in Table 15.31 
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Figure 5.7: Plot showing a close zoom in on the peak of the short event lightcurve, 
using LT, FTN and Maidanak data, before the scaling factor or offset between 
the bands had been adjusted. 

However, when the errors in the LT data were scaled so that x 2 f° r the fit to 
the whole lightcurve = 1.0 the errors for the fitting parameters (3 and ts became 
undefined, implying that there is insufficient information in the lightcurve to 
distinguish separate values for these two parameters. Therefore it seems we must 
satisfy ourselves with a value for £fwhm- However, the new fitted values were 
similar to those found with the unsealed errors, namely, t E = 2.34, = 0.748. 
If the combination 2v / 3/%£ is then formed from these values, then the result is 
^fwhm = 6.06 days, compared to the fit using unsealed errors which gave a value 
of 5.672 days, and the £fwhm found for the earliest joint fit without Maidanak 
data which was 0.711 days. The later values found using the Maidanak data 
should be preferred over the fit to the earliest data, as adding the Maidanak data 
to the peak greatly increases the number of data points and average data quality 
over the flux peak, and should thus have improved the quality of the fit which 
was achieved. Although the full Paczyhski fit above does find values for t E and 
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Table 5.3: Table showing the fitting parameters for a Paczyhski fit for the "short 
event" using the Maidanak data. 



Parameter 
Name 


Value 


MINUIT 
fitting error 


MINOS errors 


to 


53993.346 


0.015 


±0.015 


t E 


2.240 


+0.030 
-0.024 


±0.024 


(3 


0.7310 


+0.012 
-0.0024 


±0.0024 


Blt 


-6.24 


0.014 


±0.014 


Bftn 


-1.78 


0.17 


±0.17 


BMaid 


+ 1.31 


+0.45 
-0.46 


±0.46 


AFpTN,pac 


196.4060 


5.71 


±0.0037 




191.4 


+2.3 
-5.7 


±2.3 



f3 separately which when combined give a reasonable value for tFWHM, although 
not completely consistent with the directly fitted tFWHM values, it must still be 
strongly suspected that, in reality, these two parameters are degenerate, since the 
relatively high value of (3 corresponds to an amplification of only 1.34 in the high 
magnification regime, and there can be no stars in M31 with unlensed magnitude 
bright enough to have the measured peak magnitude of this event with only this 
magnification (see Section f5.3.8j) . Therefore it must be borne in mind that the 
errors quoted in Table 15.31 are merely the fitting errors calculated by MINUIT 
for the particular data points provided to it in the lightcurve, and do not show 
the true degeneracy and thus uncertainty of the tE and parameters. It might 
be expected that if it were possible to perform a Monte Carlo study in which 
many realisations of the statistical noise on each point in the lightcurve were 
selected and the fitting re-performed, that the individual parameters tE and (3 
would individually vary to a much greater extent than their fitting errors would 
suggest, but that their product would remain much more constant, as this is the 
actual well-determined physical quantity. 

In order to display the Maidanak and LT data in a way that allows direct com- 
parison of their fits to the model, the flux data were normalised to the LT flux 
amplitude by subtracting the (small) fitted offsets and multiplying the Maidanak 
data by the peak Paczyhski flux amplitude ratio between the Maidanak and LT, 
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which was equal to 1.11. The re-scaled two band fit to the Paczyriski curve is 



shown in Figure 15.81 A close-up of the peak region is shown in Figure 15.91 The 
best fit to the Maidanak data only is shown below in Figure 15.101 
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Figure 5.8: Plot showing peak region of the best full Paczyhski fit to LT plus 
Maidanak data, for the "short event" . 



5.3.7 Rediscovery of the event in the 2008 photometry 



By searching for objects in the variable object database with the same position as 
the previously identified lightcurves, the equivalent lightcurves to the short event 
were re-found in the most recent photometry. 

These lightcurves had not been selected by the candidate selection pipeline, and 
more detailed investigation revealed that these lightcurves had failed the pipeline 
primarily on Cut 6 (bump sampling) due to not having any data points in the cen- 
tral region t ±^FWHM/4. The global x 2 /d.o.f. was found to be 3.39, considerably 
better than before, and the local x 2 /d.o.f. was found to be 5.40, which, although 
still worse than the global fit is also much better than before. As can be noted 
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Figure 5.9: Plot showing the central peak region of the best full Paczyhski fit to 
LT plus Maidanak data, for the "short event" . 

from the x 2 /d.o.f. values above, which are clearly significantly larger than ~ 1 
and which were even larger in the data from earlier iterations of the DIA pipeline, 
there still does seem to be an issue with either unexplained scatter in the data 
or incorrect estimation of the error bars. We believe there may be at least three 
possible explanations for this. Firstly, there could still be some poorly understood 
problem with the DIA process; for example a badly fitting kernel. Secondly, there 
really could be low level variations going on from physically varying objects in 
the galaxy which have the effect of enhancing the "noise" to our fits. Thirdly, 
problems can be caused by poor background fits to the galaxy surface brightness. 
Interestingly, we are not the first microlensing survey to have experienced stub- 
born % 2 /d.o.f.> 1. The OGLE early warning system has produced data which 
have x 2 /d.o.f.~ 2 for bright events, which, although not as high as our data, is 
still significant given the amount of time they have had to develop and test their 
pipeline. This issue of high \ 2 which we experience, which is also seen in my 
sample of variable stars, as shown by Figure I5.16[ has been investigated more 
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Figure 5.10: Plot showing the best full Paczyriski fit to the Maidanak data only, 
for the "short event" . 



fully and the results will be described in iKerins et al.l ( 120091 ) . It has been found 



that the intrinsic noise in the data after the difference imaging process is 40% 
higher than pure photon noise. This implies that all \ 2 an d X 2 /d-o.f. described 
within this thesis as fits to Angstrom data may effectively be divided by a factor 
of 1.4 2 , i.e. 1.96. This is a very nice independent confirmation of the information 
separately and previously shown by Figure 15.161 whose peak was found to occur 
at 1.97 ± 0.19. Now that this is known, it can be seen that the local and global 
X 2 /d.o.f. above of 3.39 and 5.40 respectively, actually correspond to x /d.o.f. of 
about 1.72 and 2.76 which look much more like what what be expected for good 
fits to data. This issue has also affected the xVd.o.f. cut levels used by the 
candidate selection pipeline, set at 5 and 7, which are now known to effectively 
correspond to 2.55 and 3.57 respectively. 

The lightcurve shown with the best fits is shown in Figure 15.111 It can be seen 
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from the second panel that the majority of the data points are well fitted by the 
model, but the two worst fitted points are consecutive and both on the right hand 
side of the peak. 

The best fit parameters for the fit shown in Figure 15.111 are shown in Table 15.41 
The amplitude of the Paczyhski component of the fit in the PA data is so large, 
and the errors undefined, as the fit is almost entirely unconstrained by the PA 
data. 

It can be seen from Table I5~4l that the width £fwhm of the peak is still found to be 
very narrow, being only 1.36 days, compared to 0.71 days with the earliest 2007 
data. This timescale would make this event the shortest known microlensing 
event found in M31 to date, the next shortest being ~ 1.8 days reported by 



Auriere et al. 



(1200 lh . 
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Figure 5.11: Plots showing the microlensing candidate ANG-06B-M31-01, re- 
discovered in the 2008 photometry as objects 40763 and 45097 a) The whole 
lightcurve, spanning 9| years, with 8| seasons of data, plotted with the full 
mixed lensing + variable fit b) The central peak region, showing the fit to the 
lensing part of the fit, after the variable component has been subtracted. 
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Table 5.4: Table giving the fitting parameters for the best joint (Paczyhski + 
skew cosinusoid) fit, for the "short event" in the 2008 photometry. 



Parameter 


Value 


MINUIT 


MINOS errors 


Name 




fitting error 


(if calculated) 


00 


54131.11 


0.99 


±0.99 


P 


245.15 


0.14 


± 


S 


1.00000008 




±4.35e- 9 


to 


53993.436 


0.01 


±0.01 


^FWHM 


1.36 


+0.10 
-0.11 


±0.11 




-10.42 


0.22 


±0.22 


Blt 


-8.40 


0.22 


±0.22 




411 


+24.0 
-19.7 


±21.7 


AFpA,sin 


-71.5 


1.39 


±1.39 


BpA 


-30.6 


1.15 


±1.15 


^FpA, P ac 


37811 







From the first panel of Figure 15.111 it can be seen that the behaviour of the 
lightcurve which existed before the sudden peak has returned in the period after 
it. The previously found variable fit is still consistent with the lightcurve. This 
return to the previous baseline, although not "flat" , along with the improved fit 
to a lensing lightcurve model must be taken as strengthened evidence in support 
of the microlensing interpretation of this event. 



5.3.8 The lens mass of ANG-06B-M31-01 



The radial velocities and in some cases velocity 



dispersion of different p hysical 



components o 



M31 such a s plan etary nebulae (IMerret and et al 



ular clusters ( iPerret et all 120021 ) . o r HIT regions (Rubin and Ford 



2004), glob 



1970) have 



been measured by several authors (Ivan den Berghl . [1999J). The rotational ve- 
locity outside a radial distance of about 20 kpc has been measured as ~ 260 
km s _1 , but considering the range of the LT field, which at 4.5' corresponds 



almost exactly to 1 


£PC S 


ciuare at the d 


istance of M31, most authors, f 


□r e.g 


(Brinks and Burton, 


1984; 


Loinard et al., 


1995: 


Richstone and Shectman. 


1980 


Rubin and Ford. 


197 


0|), show a consistent picture of the rotational velocity bein£ 
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~ ±200 km s -1 at a radial distance of 1 kpc, and then rapidly falling to zero in a 
linear fashion at the centre. The velocity dispersion, which may increase the rel- 
ative velocity of lens and source stars, ha s been measured for globular clusters as 



about 138 ±3 km s 1 ( IPerret et all 120021 ). The actual relative velocity of the lens 



and source stars involved in this particular event cannot, of course, be precisely 
known, but it is interesting to consider, using average quantities, what would be 
the expected mass range of the lens star, given the measured timescale above. 
The maximum possible expected relative velocity can, however, be estimated, as 
the rotation curve is flat out to large radial distance and so twice the maximum 
value of ~ 260 km s _1 , plus a contribution from the velocity dispersion above, 
gives a value of the order of 650 km s _1 . This should only be possible for a disk 
source on the far side of the galaxy and a disk lens on the near side. Due to the 
small angle of (~ 13°) between the disk and our line of sight it is possible for ob- 
jects which are both close to the centre of the galaxy, but have radii large enough 
to have the maximum rotational velocity, to have this large a relative velocity 
difference. For example, for a line of sight through the galactic centre, assuming 
radii for both source and lens of greater than 1.25 kpc as assumed below (but in 
opposite directions), the height of both objects above the disk centre would have 
to be greater than 0.28 kpc, compared to the disk scale height assumed below 
of 0.4 kpc. Lines of sight to objects over most of the area of the galaxy will, 
however, have rotational velocities of source and lens in one direction only, and 
so will have a much smaller velocity difference. 



Equation 15 of iPaczyhskil ( 119961 ) gives a relation, in practical units, between the 
relative velocity of source and lens stars and the mass of the lens, if the Einstein 
time is known. It is (slightly rearranged for practical reasons): 



/200kms" 1 \ 

(5.1) 



where D\ is the distance from the observer to the deflector (lens), D s is the 
distance from the observer to the source, v t is the relative transverse velocity of 



5.3. The very short microlensing event ANG-06B-M31-01 



157 



source and lens, and tg is the Einstein crossing time measured in years. 

Equation 15.11 can be easily rearranged to get the predicted lens mass in terms of 
the Einstein time in days. This results in the following equation: 

M_ todays) 2 v t 2 f 1 \ 2 / 10kpc \ D s 

M & 0.214(years) 2 (200kms- 1 ) 2 ^365.2422^ \ D l ) D a -D l l ' J 

It is also necessary to make a connection between the value of which is required 
by the formula above, and £fwhm, which is fitted by the selection pipeline. It is 
almost certainly justified to assume that this event is in the high magnification 
regime, and hence Equation 11.141 applies. 

The bulge was sim ulated both as in the "exponential bulge" and "power law 
bulge" model, as in 



Kerins et al. 



( 120061 ). The bulge profile was therefore given by 



Equation 15.31 for an exponential bulge, 



_rff,2 Vb.2 ,£i,Y 

Pb = Pb,oe la qa * a J (5.3) 



where q parametrises the elongation of the bar and has the value 0.6, and a is the 
bulge scale length with the value 1 kpc. The power law index, s, has the value 
0.75. For the power law bulge model, the disk mass profile is given by Equation 



15.41 The power law index, s, has the value 3.0. 



Pb = Pb,oO--\ + — H 2 ) (5.4) 

a qa qa 

The mass distribution of the disk was assumed to be the normal double exponen- 
tial profile, 



Pd = p dfl e^ Z/H M-^ (5.5) 



where H is the disk thickness scale length taken to be 0.4 kpc and h is the disk 
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radial scale length, taken as 5.8 kpc. 



he d istance to M31 was assumed to be 784 kpc, as measured by 



Stanek and Garnavich 



(11998ft . 



Velocities perpendicular to the line of sight 

Rotational velocities of lenses and sources in the disk were assumed to be constant 
at 235 km s _1 outside a radius of 1.25 parsec. Inside this radius, they are assumed 
to act as a solid body rotator, so the rotational velocity decreases in proportion 
to radius, reaching zero at the centre. In the galactic frame, the coordinates of 
the ANG-06B-M31-01 in kpc are (-0.025,-0.805). Therefore, the line of sight 
penetrates the centre of the disk at a galactic radius of roughly 3.6 kpc, which 
means that the orbital velocity takes its maximum value of 235 km s -1 . In the 
plane of the sky, this also translates to a rotational speed of almost identical 
magnitude. 

The magnitudes of the one dimensional velocity dispersions assumed for the M3 1 



bulge and disk respectively are identical to those used in iKerins et al.l (120061 ). 
That is, where a "heavy" disk model was used, (Models 1,3 and 5), o"disk,id = 60 
km/s. For the "light" disk models 2,4 and 6, 0di s k,id was a/3 lower. Where a 
"light" exponential bulge model was used, (Models 3 and 4), this was taken as 
o"bui g e,id = 90 km/s. For the "heavy" exponential bulge (Models 1 and 2), a was 
\/3 higher, and for the power law bulge (Models 5 and 6) it was \[2 higher. In 
all cases below, a g^a = V^vid, i-e. an isotropic velocity dispersion is assumed. 

For a disk lens and source, the rotational velocities of lens and source are assumed 
to cancel, leaving only the velocity spread of the disk, Odisk- The overall motion 
was modelled using a summed velocity dispersion = v^disk^d- 

For a bulge lens and source, there is no bulk rotation- only the random motion 
described by the bulge, Obuige- The overall motion was modelled using a summed 
velocity dispersion = v^Obuige^d 
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The magnitude of the typical transverse velocity and velocity spread are calcu- 
lated by assuming an isotropic Maxwelfi an velocity distribution (such as that 



given in Equation 6 of 



Kerins et al 



(|2006l )) and integrating over both the line of 
sight and radial angle directions, leaving only the dependence on the transverse 
speed v t . 



This dependence is shown in Equation 15.6} where p is the mass density 



F(p, v t ) = v t (-^)exp-— (5.6) 

The typical magnitude of the velocity is taken as the mean speed of this distri- 
bution and the spread as its HWHM (half- width half maximum). 

When a mix of disk lens and bulge source is modelled, or vice versa, both ro- 
tational and random motions exist and so these must be combined. This was 
done by adding a rotational velocity vector and a randomly oriented disper- 
sion vector and then integrating the sum over the full range of angles of the 
dispersion vector to find the average total length. This magnitude can be writ- 
ten as (t>rof + ((2o"totai)/7r) 2 ) ' 5 - In this case the 2d velocity dispersions from 
both the bulge and the disk are combined to form the overall dispersion by 

Ctotal = VT^disk^d + ^bulgc^d)- 

Only velocity vectors are used (and not proper motion vectors) in the above 
calculation, as both lenses and sources are at approximately the same distance 
from us, and so the normal correction may be neglected. 

The calculated typical speeds and spreads, which are used as the uncertainties in 
the speeds, are given in Table 15.51 



Source-lens separations 

The typical separation of lenses and sources (D s — D{) is estimated by calculat- 
ing the mean lensing rate-weighted separation over the source and lens density 
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galaxy 


model 








1 


2 


3 


4 


5 


6 


DD 


282 ± 192 


163 ± 111 


282 ± 192 


163 ± 111 


282 ± 192 


163 ± 111 


BD 


294 ± 278 


281 ± 241 


271 ± 212 


257 ± 161 


283 ± 247 


269 ± 205 


DB 


294 ± 278 


281 ± 241 


271 ± 212 


257 ± 161 


283 ± 247 


269 ± 205 


BB 


366 ± 250 


366 ± 250 


211 ± 144 


211 ± 144 


299 ± 204 


299 ± 204 



Table 5.5: Table giving typical relative lens-source speeds in the plane of the sky 
along with their estimated uncertainties. The values are calculated for 6 galaxy 
models and the 4 combinations of disk or bulge sources or lenses. Speeds are 
given in km/s. 

distributions. This is done for all six galaxy models and the four combinations 
of lens and source locations; Disk-Disk, Disk-Bulge, Bulge-Disk and Bulge-Bulge. 
Uncertainties in these values are taken as equal to the values themselves. 

The values found are given in Table 15.61 

The values given in Table 15.61 are equal for galaxy models 1-4 as these all use 
the exponential bulge model, with various scaling factors. Since the integral is 
effectively normalised by the scaling factors, only a change in the shape of the 
bulge distribution has any effect on the final value. Hence the two power law bulge 
models 5 and 6 have slightly different values for those combinations involving the 
bulge. 

The magnitude of the event at peak 

Although there are no LT data points very close to the peak flux of this event, 
it is possible to estimate the magnitude of the event for seven of the data points 
around the peak which are the only ones in which the object is a resolved source. 

In order to estimate and calibrate these magnitudes, photometry of resolved 
sources within the Angstrom data was carried out on background subtracted 
data. The background of each image was estimated using the IRAF task rmedian 
with an inner radius of 25 pixels and an outer radius of 49 pixels. Instrumen- 
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galaxy 


model 








1 


2 


3 


4 


5 


6 


DD 


1.5330 


1.5330 


1.5330 


1.5330 


1.5330 


1.5330 


BD 


1.3648 


1.3648 


1.3648 


1.3648 


1.3809 


1.3809 


DB 


1.3295 


1.3295 


1.3295 


1.3295 


1.3488 


1.3488 


BB 


0.7666 


0.7666 


0.7666 


0.7666 


0.8533 


0.8533 



Table 5.6: Table giving the lensing rate- weighted mean values of the source- lens 
separation in the M31 bulge (kpc). The values are calculated for 6 galaxy models 
and the 4 combinations of disk or bulge sources or lenses. 



tal magnitudes were computed using the SE xtractor ( Bertin and Arnoutsl . Il996l ) 



package from within the STARLINK GAIA iDraperl (120081 ) software suite. Rel 



ative photometry was calc ulated be t ween observations and was then calibrated 



against standard stars from 



Landolt 



(119921 ) . By using the difference flux values of 
the same points to calibrate the fitted peak Paczyhski Flux, the peak magnitude 
of the event is estimated as i = +18.07 ± 0.06. 



The range of likely source stars 

In order to estimate the range of the parameter /3 which is required to produce 
the magnification necessary to reproduce the measured brightness of this event, 

the range of possible source stars was first considered. Examination of the colour- 

I 1 Jil 

magnitude diagrams which form Figure 3 of iKerins et al.l ( 120061 ) shows that the 



upper end of the Main Sequence (MS) in M31 has been modelled by us as ending 
at an R-band absolute magnitude of about —1, whereas the Red Giant Branch 
(RGB) ends at about a magnitude of —2. Correcting for the appropriate values of 
(R-I) colour, and for the distance to M31 leads to I-band magnitudes for the MS 
and RGB respectively of mj^MS = +23.47, ttii^gb = +21.87. We choose this (as 
the highest value) as the upper limit. Similar values may be obta ined by examin 



i ng th e Colour-Magnitude (CM) diagrams which are Figure 6a of 



Perry man et al. 



( 119951 ). although the plotted H p (Hipparcos) broad band (375-750nm) filter mag- 



nitudes are only partially compatible with Sloan i' band, being closer to r' or 
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g'. At the lower stellar mass and magnitude end of the CM diagram, the stars 
which have a significant probability of being sources are limited both by the de- 
creasing probability of the required high magnification event and also by peak 
magnifications beginning to be suppressed by the finite source effect, although 
the number of stellar sources increases markedly as the mass decreases. The fi- 
nite source effect begins to become significant when the radius of the source star 
R s ~ (3 Re- Therefore, (3 min ~ R S /R E , which is minimised for low R s and high 
Re- For main sequence stars, R s oc M 8 , and for low mass stars this changes to 
R s ~ O.li? , i.e. R s = constant. This means that R s does not get much below 
O.IRq. Since the size of the Einstein ring is governed mainly by the lens mass 
and the source- lens separation (see Equation 11.51) . if we assume a low mass star 
of mass 0.5M Q and radius O.li?© with a typical source-lens separation of ~ 1 kpc, 
then the value of which gives the maximum magnification can be calculated to 
be 0.00023. Since peak magnification A ~ 1//3 in the high magnification regime, 
this means that the maximum magnification for this star is ~ 4300. One could 
of course get higher magnifications by raising the lens mass to increase Re, but 
since the lensing rate scales as /3(m a5 )0(m) oc /3m~ LS , where <ft is the lens mass 
function, which decreases steeply towards higher masses. This means that if the 
lens mass is increased by a factor 100, for example, the amplification only in- 
creases by a factor of 10, but the chances of this event happening decrease by a 
factor of ~ 40000. 

Therefore the lowest values of (3 m i n and hence the largest values of the mag- 
nification can be achieved somewhere near the bottom of the main sequence. 
Hence a reasonable value to choose for the lowest mass star to consider might be 
M ~ O.5M . Knowing t hat the absolute magn i tude o f the sun is Mj = +4.08, 



(see Table 2.1, page 53, 



Binney and Merrifieldl (119981 ) ) . enables us to calculate 



that the magnitude of a solar mass star in M31 would be Mj = 28.55. Us- 
ing the relationship L oc M 3 5 which is approximately valid for main sequence 
stars enables us to extrapolate to the luminosity of a star of mass O.5M , which 
is L = O.O884L . This equates to a magnitude difference of 2.634, making its 
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magnitude Mj = +31.18. The magnifications required to achieve the peak mag- 
nitude of the event for the upper and lower limits of the above-described source 
star range are given by A min = 10 - 4 " 3 - 8 = 33.1 and A max = 10 - 4 " 13 - 11 = 176035. 

Because of the argument detailed above, this value of A max never occurs in reality 
as finite source effects would dominate. If a maximum value of 4336 is assumed, 
then, reversing the above reasoning, the maximum magnitude difference that 
could occur would be —9.09. This in turn would give an unmagnified magnitude 
for the source star of 27.16, slightly brighter than a solar mass star, in fact, 
equivalent to a star of 1.44M . This assumed lower limit on the magnification 
means that the minimum likely value of /3 is ~ 0.00023. 

Using the inverse of Equation 11.71 which is, for the peak flux, given by Equation 
15.71 for high magnifications, 



the upper limit of (3 is calculated using the lower limit on the magnification given 
above. (3 is found to range from roughly 0.0001 to 0.0302, and so the expected 
value is simply assumed to be 0.015 ± 0.015, although the actual number may 
not in fact be central to its range. 

Due to the remaining uncertainty in the fitted timescale of the event £fwhm, its 
value was taken as equal to 2.1 ± 0.8 days. Combined with the central value of 
0, this gives = 41 ± +44 days. This is formally consistent with zero, although 
we know the Einstein time must be a positive quantity. Therefore we can only 
conclude that, given the uncertainties in the method we have been forced to adopt, 
we cannot make any quantitative statement about the Einstein time, other than 
it is likely to be t& < 85 days. 



= V2 A (A 2 - 1) 5 - 1 



2 



1/A 



(5.7) 
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galaxy model 








1 


2 


3 


4 


5 


6 


Mean Mass 


S.D. 


DD 


3.50 


1.25 


3.50 


1.25 


3.50 


1.25 


2.30 


1.25 


BD 


4.25 


3.75 


3.50 


3.25 


3.75 


3.50 


3.73 


0.35 


DB 


4.25 


4.00 


3.75 


3.25 


4.00 


3.50 


3.80 


0.38 


BB 


11.75 


11.75 


4.00 


4.00 


7.00 


7.00 


7.50 


3.50 



Table 5.7: Table giving the estimates of lens mass for ANG-06B-M31-01 in units 
of the solar mass. The results are given for the 4 combinations of source/lens 
location and for 6 galaxy models. 

Estimated uncertainties 

The fractional error in the mass is calculated by propagating the errors in the 
individual variables, and by including the variations in calculated masses between 
galaxy models. 



AM 



where 




^J(^ssL)' + m)' (5.9) 

V V ^FWHM / \P/ 



Mass estimate results 



The results of the mass estimate described above are summarised in Table 15.71 
The fractional errors are calculated using only the first order approximation and 
so should only be taken as indicative of the magnitude of uncertainty, but gener- 
ally range between 2.6 and 2.9. 

As can be seen from Table 15.71 the predicted masses are generally in the giant star 
range, but with a large uncertainty. The largest contribution to the uncertainty 
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comes from the uncertainty in 0, which in turn comes from the large range of 
possible source stars, given our lack of colour information for this ev ent. Utilising 



Kerins et al. 



the ca lculations of lensing rate distributions plotted in Figure 5 of 
( 120061 ) as a guide as to which of the mass estimates above are more likely in 
practice, it may be seen that the innermost contour for disk lenses is 0.2 events 
per year per square arcminute, whereas for bulge lenses it is 2.4. A similar 
ratio would be expected if the rates were divided by the location of the source. 
Therefore the bulge lens, bulge source estimate is the most probable, followed by 
the mixed estimates, and lastly the disk - disk estimate. This also increases the 
chances of the mass of the lens being at the top end of the estimates in Table 15.71 
Therefore, if the variations in the results for different galaxy models are folded in 
with the errors in the individual mass estimates, the most likely mass estimate is 
7.5 ± 21M Q . This is formally consistent with zero, but the mass cannot be less 
than zero. So to first order, we can only conclude that the lens mass is likely to 
be less than 28.5M . Since almost all stars have masses less than this, this is a 
very weak constraint. The uncertainty of the lens mass estimate is mostly due to 
the large uncertainties in both the true values of /3 and the relative lens-source 
velocity. 



5.4 Some interesting "near misses" in the 2007 
data 

While scanning through files looking for periodic variables which could be used 
to estimate the flux ratio between the PA and LT data (see Chapter H]) two 
lightcurves, close together in lightcurve number, were noticed. Both had clear 
coherent "spike" -like deviations in their lightcurves, and did not appear to be the 
normal type of lightcurve expected to be classified as "variables". The first of 
these was LC 6530 (see Figure I5T21 . The selection pipeline merely classified this 
lightcurve as a "variable" , with no mention in any of the first iteration performed 
of even attempting to fit a Paczyhski or mixed Paczyhski + Variable model. This 
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seemed strange, given how clear was the short duration flux variation at the end 
of the third season of LT data. Therefore, further investigations were carried out, 
to establish the reasons for this. In the first iteration, no mixed fit was performed, 
due to the lightcurve failing the criterion required by Cut 2. This was because the 
long low bump, containing many data points, spanning time points 3684.97 to 
3974.06 (JD-50000) had a lower, (but not much lower) accumulated \ 2 than the 
bump which is the most obvious in flux, spanning 4106.96 to 4153.86 (JD-50000). 
A major contributing factor to this was that the earlier wide bump contained 34 
data points, while the later sharp bump only contained 13. This fact, and the 
observation that it would clearly be desirable to attempt to fit a mixed fit to 
lightcurves containing this kind of obvious spike and a variable baseline, meant 
that the criterion to decide whether the biggest bump was significantly larger 
than the next was changed to use the quantity Xbum P / n p where n p is the number 
of data points in the bump, as defined by the ordinary bumpfind routine. Thus a 
long low bump which has the same integral underneath it as a short high bump 
would no longer be "equivalent" in the eyes of this criterion. Always bumps which 
accumulate \ 2 & t a greater rate would be preferred. Using this new criterion, a 
mixed fit was indeed fitted by the pipeline, and produced \ 2 ~ 7900 compared 
to that of the pure variable fit of ~ 8200. 

13(1) ,PaS{i) Bralaii flraen LC *H658fl -Skeir^CoBiniiBoljd Fit ItBraHnn - 1 




isdo ssoao Mao mm sgh 4dod 

JD-345DO0O.5 [dnj») 



Figure 5.12: Plot showing the first iteration periodic variable fit to lightcurve 
6530. 
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Figure 5.13: Plot showing two different aspects of the lightcurve 6530 in the 2007 
photometry: a) the best second iteration reduced Paczyhski fit to lightcurve 6530, 
after the first variable component has been subtracted in the first iteration b) the 
central Paczyhski peak region of the second iteration fit. 

However, the final classification was still as a variable. Further investigation 
showed that the to of the best fit Paczyhski bump was being placed somewhere 
in the PA data, not where naively expected in the flux bump at the end of the 
LT data. It was presumed that this is because a single period sinusoid is not 
sufficient to correctly model the baseline variation of this lightcurve, as indeed 
can be seen from Figure I5.12[ especially in the LT data. In this situation, the 
X 2 minimisation routine will still try do its work and has found that a Paczyhski 
bump can more effectively be used to match to one of the remaining variable 
bumps and hence reduce the size of its residual, rather than reducing the peak in 
the LT. In confirmation of the observation that a single sinusoid is insufficient to 
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model this lightcurve, the output of the periodogram was noted to have detected 
two clear periods; of 150.6 days and 688.2 days. As a knock-on effect of this un- 
expected placement of the Paczyhski component in time, the fitted ratio between 
the PA and LT variable component flux amplitudes was found to be only 1.68, 
which caused the lightcurve to fail the "unrealistic flux ratio" condition which in 
this case had a lower limit of 2.63, and therefore be downgraded in classification 
to a "mere" variable. Given that the baseline characteristics of this lightcurve 
are more complicated than the most complex model available in the pipeline 
to attempt to fit it, it was concluded that the behaviour of the pipeline in the 
first iteration was not in fact incorrect and was to be expected. To model this 
lightcurve correctly, it would be necessary to use the "normal" bumpfind routine, 
which is able to find more than one bump, on second and subsequent iterations of 
the pipeline. This is because the "large- fdif cluster" routine which by default is 
used to evaluate the criterion for Cut 1 after the first iteration is only capable of 
finding the one largest deviation. Therefore is it normally not possible to find a 
variable on the first iteration and a mixed fit on the second, as this requires more 
than one bump to be found on the second iteration. It would be better if it were 
possible to find a mixed fit on the first iteration and then deal with any remaining 
variability on the second iteration, but this is apparently not possible within the 
current framework without intervening to force the Paczyhski peak to be within 
the range of the short flux spike, even if it meant that the resulting solution was 
not the lowest x 2 one th & t; could be found. This has not been implemented, as it 
has been a principle in constructing this pipeline to allow the fitting to find its 
own fits as much as possible and not to attempt to influence the results through 
more complicated methods which might not themselves always work in every sit- 
uation. Fitting only the sinusoidal variable on the first iteration, as was finally 
chosen, is not ideal either, as it can be seen from Figure l5.12l that this fit has been 
skewed, especially in the LT, by the influence of the high flux spike. The option 
to use the first iteration bumpfind routine instead has been included as an option 
in the pipeline, but is not used by default as it was felt that a combination of 
two variable components and a Paczyhski component would be too flexible and 
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could fit many different lightcurves, without necessarily being the correct model. 
As a tool to be used in individual cases, (such as this) where inspection by eye 
can be used as a guide, then perhaps this technique could be of some use. 

On the second iteration, having removed some of the baseline variation by sub- 
tracting the first variable component, the pipeline did indeed manage to find the 
expected simple Paczyhski fit to the flux bump at the end of the LT data. This 
had a global (whole lightcurve) x 2 /d.o.f. of 8.66, but a local (just the central 
peak) x 2 °f 1200.8! As can be seen from the second panel of Figure I5.13[ most 
of the points in the central region of the peak are a surprisingly good fit to the 
model with the exception of the point at time 4116.85 which is 31.4 times its own 
error bar above the model fit, and therefore contributes 986.7 out of the total \ 2 
of 1200.8. In addition, if one looks at the first panel of Figure I5.13[ there are 
several points to the right of the main peak which are clearly above the model. 
Taken together, the three points of evidence that 1) There is a highly discrepant 
point which is earlier than the fitted peak, 2) The flux appears, from the few 
data points available to have at least two bumps and 3) The part of the declin- 
ing lightcurve covered by the end of the data is clearly above the model, point 
towards the best explanation for this lightcurve being a classical nova. Clearly 
this would have been easier to determine if the event had not been so close to the 
end of the season and there had been more points in the tail of the lightcurve. 
However, if this event was indeed a nova, then it certainly declined very rapidly 
indeed. Some idea of the timescale can be gained by the best fitting Paczyhski 
curve FWHM timescale £fwhm = 2.17 days. For completeness, the second itera- 
tion Paczyhski fit also failed two more of the pipeline cuts in addition to the local 
X 2 one. Firstly, it failed the bump time sampling cut as only 4 data points are 
within ±2t FW HM of to, and, for related reasons, failed Cut 9, as the significance 
of the peak was shared by too few (< 5) data points. 
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5.5 Results of Pipeline 
5.5.1 Variable stars 

All the analysis of variable star candidates in this section was made using the 
data from the 2007 photometry. As mentioned previously, two versions of the 
pipeline were maintained, one in which a constant ratio was maintained between 
the flux amplitudes of the cosinusoidal function in the PA and in the LT and 
FTN data fits, and another in which this ratio was allowed to vary freely as the 
PA and LT and/or FTN cosinusoid flux amplitudes were all independently fitted 
variables. 

Experimentation during the writing of these codes had confirmed that the use of 
a greater number of fitting variables, and/or more data points caused the fitting 
routine to run slower. Also, due to the larger dimensional space which must 
be covered by the minimisation routine, more parameters lead to it being more 
difficult for the routine to find the global minimum in x 2 space. In other words, 
the efficiency of the fitting routines at finding the best solutions decreases as 
the number of fitting parameters increases. When using the freely varying flux 
amplitude version the solution found, if it is the global minimum, will represent 
the true lowest x 2 an d hence the true best fitted flux ratio as opposed to fixed 
flux ratios which will usually only find solutions which are close to the best. So, 
both versions of the code have their own advantages. 

Therefore, to maximise both the number of variables found and the accuracy of 
the minimum x 2 solution for a given lightcurve and hence the accuracy of the 
fitted flux ratio, several runs of the pipeline were performed using both versions 
of the code, and several different fixed flux ratios. In total, four runs were used to 
produce the results described in this section, as well as the results for microlensing 
for the analysis of the 2007 photometry data. Three of these were fixed flux 
ratio runs, using values of the ratio designed to span the most probable range of 
this quantity found from the work in Section 14.6.11 and plotted in Figure 14.101 
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After the runs of the main pipeline had been made, the lightcurves which were 
classified as either "variable" or "mixed" were selected and the fitted flux ratio 
calculated. Information about the other fitted parameters such as period and 
skewness was collated from the output files of the main pipeline runs and then 
the data were outputted to files, binned according to LT flux amplitude, with the 
width of each bin being equivalent to 1 ADU/s, in a similar way to Section 14.6. II 
Some details of the exact operation of the variable lightcurve selection scripts 
will now be described. The first selection performed on lightcurves by the codes 
was on their classification as either "variable" or "mixed" by the main pipeline. 
It is an important point to note that for "mixed" events it would not have been 
desirable while specifically investigating the properties of variable stars to have 
accepted lightcurves which had a significant modelled microlensing component 
as this might have skewed the modelling of the variable component parameters. 
Therefore only "mixed" events which had failed the main pipeline cut on lensing 
signal to noise were chosen for this part of the work. Therefore any contribution 
from the microlensing component to the fit was small. No specification was 
made at this stage on the signal to noise of the variable component, as one of the 
things to be investigated was the distribution of fitted flux amplitudes. Therefore 
even lightcurves with very small fitted amplitudes (which had a higher chance of 
being spurious) were admitted. As will be seen later, these very low signal to 
noise lightcurves turned out to be rare, (as also seen in Section |4.6. ip . so their 
contribution to distributions of other fitting parameters was minor. The next 
selection cut made was to require the reduced x 2 °f the fit to be less than 15 as 
in the main pipeline (Cut 5). This was deliberately chosen to be a loose cut, as a 
good range for the later investigation of the distribution of reduced x 2 among the 
selected lightcurves would be required. It would always be possible to impose a 
stricter reduced x 2 cut at a later stage, if required. As in Section 14.6. 1\ a cut was 
made on the number of data points in each data band, also using the same cut 
values as previously, namely: 1) total number of points > 250, 2) PA points > 80. 
In the case of freely fitted flux amplitudes (and hence flux ratios between PA data 
and other bands), a cut was made that the flux ratio should be < 100.0, which 
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was designed to cut out the very small number of attempted fits to extremely 
noisy data, or where the lensing and variable component amplitudes had derived 
abnormally large values (perhaps largely cancelling one another out in magnitude 
to fit a particular lightcurve). 

Events rejected by the main pipeline as having too long a period and too low a 
lensing signal to noise were allowed through into the sample of "mixed" events to 
allow fair comparison between "variable" and "mixed" parameter distributions. 
It was good to note from the point of view of lensing selection that several of 
the newly allowed lightcurves did indeed form part of the extreme outlier tail, for 
example in their extremely large values of period and flux amplitude, although 
others did lie in the body of the main distribution. As shown in Table 15.91 the 
numbers involved in this change turned out to be small. 

Four versions of this script were also made, each designed to select out a different 
set of lightcurves, namely those where only LT or FTN data existed, or where 
PA data also existed, or lightcurves classified as "mixed" or "variable". Hence 
16 (=4x4) separate groups of lightcurves were initially maintained to allow the 
similarities or differences in the statistics of the fitted parameters in these different 
groups to be investigated. The reason for keeping the lightcurves with differing 
combinations of data bands separate was to investigate whether, and if so how, 
the extra baseline length gained by adding PA data changed the periods that 
were found. 

The numbers of lightcurves selected in each of the 16 groups are shown below in 
Table 15.81 The numbers do not vary with changing fixed flux ratio when there 
are only LT data, as in this case the flux ratio between PA and LT (for example) 
is irrelevant to the fitting. 

To allow visual comparison between the 16 sub-divisions, ((LT+FTN) data only 
or (LT+FTN+PA) data, Variable or Mixed fit, Fixed flux amplitude ratio low, 
medium, high or unfixed) and to establish that combining the groups did not de- 
stroy information or add two or more inconsistent populations, the distributions 
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Table 5.8: Table showing the number of lightcurves classed as either "variable" or 
"mixed-(low lensing component)" for the four selection pipeline runs, subdivided 
into telescope data source groups. 





Ratio 


LT and/or 


FTN data 


PA+LT and/or 


FTN data 




PA/LT,PA/FTN 


Variable 


Mixed 


Variable 


Mixed 




7.89,8.53 


792 


372 


303 




700 


Fixed Flux ratio 


11.10,12.00 


792 


372 


303 




736 




14.80,16.00 


792 


372 


304 




728 


Variable Flux ratio 




651 


258 


187 


444 



of period (plotted as log (P)) and skewness parameter, S were plotted. In each of 
the summary plots the three vertical columns also first divide the data into two 
parts, for low (0-5 ADU/s) flux amplitudes and higher flux amplitude ratios and 
then in the third column show the undivided total distributions. This division 
is made because the most obvious differences in the plotted distributions usually 
occur between the two groups illustrated; it might be more accurate to say that 
the low amplitude variables differed from the rest, but they also dominate hugely 
in number/ADU. The equivalent distributions to the skewness distributions were 
also plotted as fractional rise-time for each variable candidate, (where rise-time 
is defined as the fraction of one period taken up by the time taken to rise from 
the lowest flux point to the highest). It should be remembered that the skewness 
parameter can be converted into a fractional rise-time using Equation 14.51 How- 
ever, for the particular skewness distributions found the rise-time plots gave no 
additional information so are not shown here. 

An example for one only of the 4 flux amplitude ratio categories of summary plots 
for each of skewness and log (P) variables are shown in Figures 15.141 and 15.151 
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Figure 5.14: Histograms of variable lightcurve log (period) distributions: PA/LT 
flux ratio = 7.89; Rows, top to bottom: 1) 'variable' lightcurves containing (LT) 
or (LT + FTN) data only, 2) "mixed" lightcurves containing (LT) or (LT + 
FTN) data only, 3) 'variable' lightcurves containing (PA + LT) or (PA + LT + 
FTN) data, 4) "mixed" lightcurves containing (PA + LT) or (PA + LT + FTN) 
data. Columns, left to right: Flux amplitudes, 1) 0-5, 2) 5-'MAX' 3) Sum of all 
categories 'MAX' varies with row as: 1) 31 2) 68 3) 24 4) 32 . 
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Combining variable star categories 

Having established that distributions of period and skewness were reasonably 
consistent between different fixed flux ratio pipeline runs and also between fixed 
and unfixed flux ratio pipeline runs, the sixteen categories detailed above were 
combined in several different combinations to enable further investigations of 
consistency to be performed (for example between "variable" and "mixed" and 
"fixed flux ratio" and "unfixed flux ratio" classifications) and also to increase 
the number of lightcurves in each grouping to improve statistics. Six obvious 
combinations of the 16 sub-categories were formed as follows: 

1) all fixed ratio "variables" (only LT data present) 

2) all fixed ratio "variables" (at least LT and PA data present) 

3) all fixed ratio "mixed" (only LT data present) 

4) all fixed ratio "mixed" (at least LT and PA data present) 

5) all "variables" (all four flux ratios and both LT only and (LT+PA) data) 

6) all "mixed" (all four flux ratios and both LT only and (LT+PA) data) 

Having concatenated members of the group of sixteen sub groupings appropriately 
to form the above groups, the six groups above were processed to resolve the 
occasions where lightcurves were duplicated from different pipeline runs within 
the same group. There existed a substantial overlap between each pipeline run 
of the lightcurves selected and categorised, as would be expected due to the wide 
reduced x 2 window used, but clearly it was required to select the best fit from 
the maximum of four possible fits performed. Therefore the groupings above were 
consolidated by selecting only the pipeline run for each lightcurve which had the 
lowest reduced \ 2 . In Table I5T91 below the starting numbers of lightcurves in each 
of the above groups and the numbers remaining after consolidation by selection 
on lowest reduced \ 2 are detailed. This step had also previously been performed 
when the "long period low contrast" lightcurves had not been included and so 
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Figure 5.15: Histograms of variable lightcurve Skewness distributions: PA/LT 
flux ratio = 7.89; Rows, top to bottom: 1) 'variable' lightcurves containing (LT) 
or (LT + FTN) data only, 2) "mixed" lightcurves containing (LT) or (LT + 
FTN) data only, 3) 'variable' lightcurves containing (PA + LT) or (PA + LT + 
FTN) data, 4) "mixed" lightcurves containing (PA + LT) or (PA + LT + FTN) 
data. Columns, left to right: Flux amplitudes, 1) 0-5, 2) 5-'MAX' 3) Sum of all 
categories 'MAX' varies with row as: 1) 31 2) 68 3) 24 4) 32 . 
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Table 5.9: Table showing the numbers of lightcurves in each of 8 categories after 
consolidation according to lowest reduced x 2 . The equivalent numbers if the "long 
period low amplitude" lightcurves are also given for comparison. 





NOT incl. 


"long period" 


incl. "Ion 


g period" 


Grouping 


Before x 2 


After x 2 


Before x 2 


After x 2 




Consolidation 


Consolidation 


Consolidation 


Consolidation 


1. Fixed ratio, Vars, LT 


792 


792 


792 


792 


2. Fixed ratio, Vars, +PA 


910 


403 


910 


403 


3. Fixed ratio, Mixed, LT 


315 


315 


372 


372 


4. Fixed ratio, Mixed, +PA 


2148 


968 


2164 


974 


5. All ratios, Vars, LT, +PA 


2540 


1522 


2540 


1522 


6. All ratios, Mixcd,LT, +PA 


3117 


1548 


3238 


1638 



those numbers are also detailed in the table to give an idea of the relatively low 
numbers of lightcurves added by including these. Only the lines involving "mixed" 
events change because the "long period low contrast" cut is only applicable to 
mixed lensing candidates. 

Adding together groups 1 and 3 and also adding groups 2 and 4 to give purely "LT 
only" and "(LT + PA) only" data sets respectively, gave fractions of completely 
maximal allowed skewness of 34.3% and 30.7% respectively for the LT and (LT + 
PA) groups. Examining each of groups 5 and 6 (representing all "variable" and 
all "mixed" lightcurves in that order) for the maximal skewness value produced 
proportions of 36.9% and 27.0% respectively. The number of lightcurves involved 
in all of the four groups above was over 1000, so the statistics were fairly good. 
Therefore it seemed that the proportion of high skewness lightcurves was more 
strongly correlated with the categorisation into "variable" or "mixed" than with 
the length of the baseline of data. One speculation as to the reason for this 
might be that the "mixed" lightcurves, although selected to have low lensing 
contribution, might nevertheless allow the fitting routine to make use of it to 
better fit any steeper rising portion of the lightcurve, leaving less "work" for the 
variable component to do. It has not yet been ascertained whether this suggestion 
is correct. 

Observations: There are several things which may be observed from examination 
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of the histograms which are plotted, (Figures 15.141 and 15.151 being examples). 

It is interesting to see the perhaps unexpectedly large proportion of the lightcurves 
which were best fitted with the maximal allowed skewness value of 1.574. 

Since the maximum fractional rise-time explored using the cosinusoid function 
being employed for this pipeline run was only 0.356, this peak probably represents 
the integral of all lightcurves which would have been better fitted with a higher 
skewness, and there seems plenty of room between fractional rise-times of ~ 
0.05 (roughly the most extreme known) and 0.356 for this to be possible. Two 
examples of variable stars, (specifically in this case Cepheid variables) which have 
fairly simp l e ligh tcurves and fractional rise-times less than 0.356 can be found in 



O'Connelll f)1955h . 



The distribution of the reduced y 2 of all fitted cosinusoids to the above 3160 
lightcurves, this number being the sum of the consolidated totals of groups 5 and 
6 in Table |5T9~| was plotted in Figure EUS below. The distribution appeared highly 
Poissonian with a peak which was close to 2. If the centre of the peak is taken to 
be in the centre of the 11th bin then the peak occurs at 1.97 ± 0.19, assuming an 
approximate reduced x 2 error of ±1 bin width. If the error estimates for these 
lightcurves had been made correctly then it would be expected that the peak of 
this distribution would be at ~ 1, so this seemed to imply that re-scaling of the 
errors by multiplying by a number ~ \/2 might be justified, if the assumption 
was made that the model used completely explained the variations in the data. 
Alternatively, if the errors have been correctly estimated then there must be 
additional variations in the data which are not explained by the model. 

When a tighter cut on reduced x 2 °f the fits was applied (using x 2 /d.o.f.< 5.0), 
the number of lightcurves retained in the "all flux ratios, all data, all fitting 
functions" group was reduced from 3160 to 2546, (80.6% of the original number, 
which seemed a reassuringly high fraction of the whole, implying that there were 
not many "bad" fits). 
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Figure 5.16: Reduced \ 2 distribution for all 3160 variable and mixed (with low 
lensing contribution) classified lightcurves. 

Relationships and correlations between fitted parameters 



In Figure I5.17[ a clear correlation can be seen, for the periodic variables with 
periods between about 100 and 500d, between period and flux amplitude. Specif- 
ically, as the flux amplitude increases, so does the mean period of the distribution. 
This is indeed in the direction that the correlation would be expected to be since 
larger stars have longer (fundamental) periods on average, and are also brighter 
on average, and hence have larger mean variability amplitudes. There have been 
several other examples of th is sort of correlation being found. Some examples are: 
Bonanos and Stanek ( 20031 ) plotted the relationship between the period and flux 



amplitude of Cepheids as log(P ) versus loK(F amp ). A corre 



exist, but it was not very tight. 



The MACHO Collaboration 



ation was found to 



fll997[ ) also plotted 



the P-F amp relationship and described a "suggestion" of rising flux amplitude 
with rising period for bright, long period variables from MACHO survey of the 
LMC, SMC and Milky Way. This should be the same kind of star sampled by 
the stars in the main group in Figure 15.171 They also state that the RR Lyrae 
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stars found in their survey form groups at log (P/days) = —0.3 (P = 0.5d) for 
RRab stars and log (P/days) = —0.5 (P = 0.3d) for the RRc stars. These stars 
are the class of variables with the shortest currently known periods, although 



the shortest current 



( iSanwal and Sarma 



y kn own Cepheid variable has a period of only 1.24 days 



19911). 



It could also be seen, more clearly in Figure 15". 191 that the main group is elongated 
towards higher logP, especially for lower flux amplitudes, and that there was a 
hint of a separate very weak second grouping at periods above about 525 days. 
This possibility was investigated further, below. In an attempt to make the 
relationship between P and P a mp,LT in our data clearer, the data were plotted as 
a scatter plot, using log(P) (instead of P) as the x coordinate. This is shown 
below in Figure 15.171 

The main group of data between about log (P/days) = 1.6 and log (P/days) = 3.5 
appeared to follow a linear relationship with the flux amplitude, so the gradient 
of this strong correlation was calculated using linear regression. The equation of a 
line which is a good fit to all data between log (P)/days = 1.9 and log (P/days) = 
2.72 was found to be 



Pamp,LT = Gl0g(P)+ir (5.10) 

where the values of G and K were 11.92+5° 6 3 6 and -21.16^3 3 2 7° respectively. 

The scatter plot of all data with log(P/days) > 1.9, are shown in Figure 15.181 As 
can be seen in the Figure, the line which fitted best the trend formed by the major 
grouping of points was the red line corresponding to regression minimisation of 
the log (P) variations. This had equation F = 102.68 log (P/days) — 235.06. 
Equation 15.101 can also be written in the form 



p — -^Q(F am p,LT-K)/G 



(5.11) 
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Figure 5.17: Scatter plot showing the relationship of log (period) and LT flux 
amplitude for 2546 variable star candidates selected to have reduced x 2 < 5. 
Colours represent the skewness of the cosinusoid used to fit the lightcurve. Yellow 
= low skewness: 1.0 < S < 1.1, blue = medium skewness: 1.1 < S < 1.474, red 
= highest skewness: 1.474 < S < 1.574. 



which allows the expectation value of the period to be estimated given the flux 
amplitude of the variable. The calculation of the fit line through the main body of 
points was performed using only those lightcurves with 1.9 < log(P/days) < 2.72. 
These two limits correspond to the lower edge of the main group of points and 
the point which is roughly the upper boundary between the main group and the 
other possible smaller group of lightcurves which lie at periods above P ~ 525 
days. This meant that 302 lightcurves were not used because they were below 
the lower limit and 272 because they were above the higher limit. This left 1972 
lightcurves to be used in the fitting. 

When the fit given above had been found, all the original 2546 data points were 
transformed to remove the fitted dependency on flux amplitude, so that at every 
value of amplitude the distribution in Period was centred on the same value of 
log(P). This was done because in Figure 15.181 the distribution in cross section in 
the log(P) direction seems to be of a consistent width, and so could be more fun- 
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Figure 5.18: Scatter plot showing the relationship of log (period) and LT flux 
amplitude for the main group of variable star candidates with log (period) > 
1.9 and the best fit line to the strong linear correlation. Colours represent the 
skewness of the cosinusoid used to fit the lightcurve. Yellow = near minimum, 
blue = intermediate, red = near maximum. 

damental than after it has been "smeared out" in log(P) space by the correlation 
with flux amplitude. 

This was done not with a rotation (for example, around the origin), but rather a 
simple horizontal translation of the data points, as it was not required or desired 
to change the coordinate value in the flux amplitude (y) direction. However, it 
was also not desired to move the mean value of the group of points, so this was 
corrected for afterwards. 

The translation was done by an initial amount which "pivoted" around the best 
fit line's point of contact with the log(P) (x) axis. Then the difference between 
the log(P) value of this intercept and the original mean value of log(P) was added 
back to bring the (now vertical) distribution back to approximately its original 
position in log(P). Hence, in summary, if (f> is the angle between the line of best 
fit and the vertical (i.e. = ^ — 6* where 9 is the angle corresponding to the fitted 
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gradient above) the correction applied is shown in Equation 15.131 



log (P/days)' = log (P/days) - F tan(0) + F tan(</>) , 
F' = F 



(5.12) 



This is equivalent to rotating by an angle around the mean of both log (P/ days) 
and F but maintaining the F values. 

The scatter plot of "shifted" log(P) versus F after the dependence of log(P) on 
flux amplitude has been removed is shown in Figure 15.191 
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Figure 5.19: Scatter plot of "shifted" log(P) versus variable flux amplitude after 
the correlation of log(P) and flux amplitude has been removed by transforming 
log(P). 

After this transformation had been applied, the histogram of the "shifted" log(P) 
distribution was re-plotted. This is shown in Figure 15.211 with the equivalent 
figure before the transformation given in Figure 15.201 for comparison. Since the 
transformed angle is not large in log(P)-P space, the difference is not huge, but 
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the peak of the distribution has clearly and predictably shifted downwards in 
the shifted log(P) direction and become more rounded. The first result that 
this process has provided is the "true" distribution of log(P) at any constant 
value of P; in other words the horizontal cross sectional distribution through the 
main group of points. The second motivation for doing all this was to investigate 
whether the second group of points which occupy the area with P >~ 525 days 
are a second true grouping of variables or a grouping caused by some feature 
of the pipeline program. It was anticipated that if the first option were true, 
then their distribution would become "sharper" when the same transformation 
was applied to them as to the main group, since they should follow the same 
relationship as that group, and would now not be smeared out as much in log(P) 
space. However, it can be seen in Figure 15.211 that in fact the opposite has 
happened, and the previously seen peak has become more smeared out, rather 
than less. This is apparently indicative that the original group with P >~ 525 
days did not exhibit any clear trend with F and were on average at a constant 
value of log(P). This seems to make it more likely that they are a grouping which 
is caused or accentuated by the pipeline program rather than being grouped for 
some physical reason. This does not mean that these lightcurves, which after all 
comprise 8.6% of the selected lightcurves, are not valid variable stars, but that 
the log(P) distribution in this area should not be over-interpreted. Also, the 
statistics are still low, and it may still be possible for features in the distribution 
of log(P) versus F to appear, change or disappear given more or better data. 

The relationship between period and skewness was also investigated, as this 
seemed an interesting area to look for possible new correlations. No significant 
correlations were found in any of the above 16 smaller groups, or in the combined 
sample, so the plots are not presented here. The only visible systematic cor- 
relation was that already investigated above, in which lower variable amplitude 
lightcurves are found on average to have lower periods, which is also shown by 
Figure 15.171 above. 

The spatial distribution of all selected "variable" and "mixed" objects with cosi- 
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Figure 5.20: Histogram of log(P) before the main group of lightcurves has been 
transformed in log(P) to remove the dependency on flux amplitude. 



nusoid fits having reduced x 2 < 5 were plotted in Figure 15.221 

An additional dimension for investigation was made available by using the variable 
flux amplitude. The spatial plot was coarsely divided into four flux amplitude 
bins (represented by the different colours in Figure I5.22p in order to investigate 
whether the spatial distribution changed significantly with variable amplitude. 
Consistently in all the spatial distribution plots presented below, the colour codes 
are defined as follows: LT variable flux amplitude, F : 
Yellow: F < 5.0 ADU/s, 
Green: 5.0 < F < 10.0 ADU/s, 
Blue: 10.0 < F < 20.0 ADU/s, 
Red: 20.0 < F ADU/s. 

The dark line, which is inclined at 37.7° to the y axi s, represents the orient ation 



de Vaucouleurd (119581 ). and 



or "position angle" of the M31 disk, as measured by 
the smaller ellipse represents one possible model for the orientation of the M31 
bulge, represented as a triaxial ellipsoid being 0.955 kpc along the long axis of 
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Figure 5.21: Histogram of "shifted" log(P) after the main group of lightcurves 
has been transformed in log(P) to remove the dependency on flux amplitude. 



the ellipsoid. The particular model illustrated is taken from lStarkl ( 119771 ). namely 
the model with (f> = 44°. Although the original model had the length of the long 
axis of the ellipsoid equal to 2.76 kpc, this has been halved to fit on the scale of 
the LT field. The orientation of the ellipse describing the outside surface of this 
ellipsoid, as see n by us, is 10° below (on the left) the M31 disk, consistent with 
Lindbladl (119561 ) . The pair of less dark lines represent the major and minor axes 
of an ellipsoidal bulge which appears to the observer to be oriented 10° below (on 
the left) the M31 disk. 



Several features can immediately be observed in Figure 15.221 Firstly, the density 
of points in general increases markedly towards the centre (0,0) of the galaxy. 
This is clearly consistent with the increasing density of stars towards the centre 
of the galaxy. Secondly, the central concentration of variables can be seen to 
increase as the magnitude of the flux amplitude increased. This is consistent with 
variable stars with greater variability being comparably more easily observed near 
to the centre of the galaxy, where the photon noise from the subtracted galaxy 
background is the greatest. The converse of this statement is to notice that there 
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Scatterplot of periodic variable LC positions, colour-coded by LT flux 




x (arcsec) 



Figure 5.22: Spatial distribution of all 2546 variable and mixed lightcurves se- 
lected to have reduced x 2 < 5.0. 

is a relative lack of yellow (relatively low amplitude) variables observed in the 
central region. This may be explained by their variable signal being swamped 
by the high photon noise in the centre. Thirdly, there appears to be a clear 
horizontally-oriented gap in the data at a y value of roughly —160 to —170". The 
reason for this is not clear, as any dust lane would be expected to be oriented 
at the same angle as the disc, i.e. about 37.7 degrees to the vertical, but, if the 
position is compared with the bottom edge of the objects which contain PA data 
seen in Figure 14.91 the y coordinate appears similar. The reason the overlap of 
LT+PA data as shown by the coloured points in Figure 14.91 is limited on the 
bottom edge is because of the edge of the LT data, not the PA, so it is possible 
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that the gap mentioned above coincides with the most frequent position of the 
edge of the LT frame, which is masked out due to image defects on the edges of 
images. 

Finally, the general distribution (looking, for example, at the blue points) does 
seem to be elliptical, as expected (remembering the x and y scales on the plot 
are different!). The apparent ratio of y radius / x radius appears to be ~ 2.0, 
e.g. ~ 50" in y compared to ~ 100" in x, as shown by Figure 15.251 This ellipse 
does not, by eye, appear to be oriented in the same direction as the M31 disk 
line, and does seem to be lower than it on the left of the centre, as expected, but 
to a greater degree than might be expected. This is investigated further below in 
Section 15.5.11 

To further investigate point one above, the variables are grouped according to 
their flux amplitude, in bins of size 1 ADU/s, up to a maximum of 16 — 17 
ADU/s. Then the spatial distributions are plotted for each amplitude bin. The 
boundary between the distribution being concentrated around the edge of the 
area plotted (with a noticeable central hole) and it being centrally concentrated 
on the galaxy (with no central hole) appeared to be at around 4 — 5 ADU/s. To 
illustrate this, the three groups spanning this flux amplitude are presented below 
in Figure 15.231 From the full range of these 16 plots there are several examples 
of non-random-looking distributions involving clusters of points, implying that 
perhaps these represent real transverse distributions which vary in detail from 
the global average "symmetrical, centrally concentrated elliptical" distribution. 

It is also interesting to plot the spatial distributions of variable stars grouped 
according to period, to see whether the obvious groups on the Period- Amplitude 
plots varied in their spatial distributions and so might or might not be separate 
physical populations. Four groupings according to period are shown in Figure 
15^31 being P < 10 days, 10 < P < 30 days, 30 < P < 100 days and 100 days 
< P. These divisions were guided by the apparent concentrations of lightcurves 
in period that appeared in Period- Amplitude plots such as Figure 15.191 




Figure 5.23: Spatial distribution of variable stars with flux amplitudes in the 
ranges a) 3-4, b) 4-5 and c) 5-6 ADU/s, showing the cross-over region between 
central dominated and central sparse distributions. 
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Orientation and Ellipticity of the distribution of variable objects 

One of the earliest things that was done in attempting to investigate the orien- 
tation of the M31 bulge was to attempt to better visualise the data. To this 
end, contour plots of the density of variable candidates in the LT field were con- 
structed, by binning the data in square bins. With a quite coarse bin size of 40", 
some idea of the underlying structure could be gained. The resulting contour 
plot is shown below in Figure 15.251 The edges of the data affected the contours 
nearest to the edge, but there was sufficient central data to show a clear central 
quasi-elliptical peak (elliptical within the limits of the coarse binning), which ap- 
peared to be oriented in an almost horizontal direction with respect to the edges 
of the field, surrounded by (especially in the lower left) other contours which 
seem to lie in a similar direction as is expected for the M31 disk (shown by the 
diagonal line on the plot). Examining the three innermost contours surrounding 
the central peak, the outermost of these, at the 50 contour level, is roughly 220" 
in diameter in the x direction and 110" in the y direction. The next innermost, at 
the 65 contour level, is roughly 180" in x and 70" in y. The two ratios of elliptical 
major axes divided by minor axes (a/b) derived from these two contours are 2.0 
and 2.6. It may also be observed from Figure 15.251 that there appear to be two 
central peaks, and both of them are centred about 10 — 15" above the zero in 
the y direction. The gap between the two peaks appears to be consistent with 
the expected incompleteness of the survey in the very central regions of the field. 
This is investigated in more detail below in Section 15.5.11 

Investigating the azimuthal distribution 

The second investigation that was performed was to divide the field into circular 
annuli and then to divide each annulus into azimuthal sectors and to bin the 
objects in that direction also. The aim was to see if any coherent trend could 
be detected in the angle of the azimuthal sector which had the most objects in. 
If so, then the distribution of objects could be confirmed as non-random, and 
could therefore contain potentially useful information. If the assumption is made 



5.5. Results of Pipeline 



191 




Figure 5.24: Spatial distribution of variable stars with fitted periods in four 
different ranges- (left to right and top to bottom): a) below 10 days b) between 
10 and 30 days c) between 30 and 100 days d) greater than 100 days. 
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Contour Plot of spatial distribution of variable stars 




300 



Figure 5.25: Contour plot showing the way the density of variable stars changes 
over the LT field. The data were binned using 9 x 40" bins in each direction. 
The diagonal line represents t he or ientation of the M31 disk, as measured by de 
Vaucouleurs (jde Vaucouleursl . Il958l ) . 



that the distribution of objects on the plane of the sky may be modelled by 
a two dimensional elliptical Gaussian distribution with some unknown vertical 
(number) radial dependence, and this distribution is integrated around circular 
annuli, it would be expected that each annulus would possess two diametrically 
opposite peaks corresponding to the major axis of the elliptical distribution, along 
which the higher contours project relatively further out into a particular circular 
annulus than on the minor axis, where the contours would be more radially 
compressed. This hypothesis was tested by dividing the field into circular annuli. 



The maximum radius of the outermost ring was set at 200", close to the left 
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hand edge of the data, but well inside the data point with the largest radius, 
which lay at 281". Different combinations of numbers of sectors and annuli were 
experimented with to find the best compromise between angular and radial reso- 
lution and statistics in each bin, with the aim of getting the maximum amount of 
information out of the available data, without straying into the regime of noise. 
Combinations which produced reasonable results were (16 sectors, 4 rings), (32 
sectors, 4 rings), (32 sectors, 8 rings) and (64 sectors, 4 rings). With either more 
rings or more sectors than this, the statistics of each bin were such that a clear 
peak could not easily be defined due to the \fN noise dominating any signal. 
The criterion used to decide whether a particular ring had good enough statistics 
to be selected as a data point was in two parts, "peaks" in the data were first 
defined as an angular region having continuously falling number density on either 
side of a local maximum. Then the numerical difference between whichever was 
the bin with the highest number of objects in it and the bin corresponding to 
the highest bin of the second highest peak had to be greater than the sum of the 
square roots of the two numbers involved. In other words, the largest peak had 
to be clearly highest by the sum of the two Poissonian errors of the two highest 
peaks. Secondly, to avoid ruling out any ring which happened to detect the dia- 
metrically opposite peak that would be expected on the other side of the galaxy 
centre at an almost equal level to the first, secondary peaks were only considered 
if they lay within ±90° of the largest peak. Many rings contained data which 
were clearly highly correlated with rings around them, which increased confidence 
that the peaks found were non-random (i.e. physical) phenomena. Applying the 
above two criteria strictly to individual rings with the above four sets of sectors 
and rings led to 9 sufficiently clear data points (out of a possible 20). Due to the 
incomplete angular coverage of the data (due in turn to the edges of the rectan- 
gular LT field) not all angular bins contained points and those which intersected 
the edge of the data contained fewer points than they would have done in an 
infinite field. Rings with smaller radii had better angular coverage, being further 
from the edges of the data. In order to make best use of data not selected in the 
first selection process, and to gain as much information as possible on the inner 
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Table 5.10: Table listing the data points derived for the investigation of the 
azimuthal angle of the peak of the azimuthal number density distribution of 
variable star candidates. 



Number 


Number of 


Rings 


Peak 


Peak 


Angle 


Mean 


Radius 


of Rings 


Sectors 


Used 


Sector 


Angle 


Error 


Radius 


Error 


16 


4 


1 


9 


163.550 


11.2500 


175.0 


25.0 


16 


4 


2 


9 


163.550 


11.2500 


125.0 


25.0 


16 


4 


3 


9 


163.550 


11.2500 


75.0 


25.0 


16 


4 


3 + 4 


9 


163.550 


11.2500 


50.0 


50.0 


32 


4 


1 


17 


157.925 


5.6250 


175.0 


25.0 


32 


4 


3 


18 


169.175 


5.6250 


75.0 


25.0 


32 


4 


1 + 2 


17,18 


163.550 


11.2500 


150.0 


50.0 


32 


4 


2 + 3 


18 


169.175 


5.6250 


100.0 


50.0 


32 


4 


3 + 4 


18 


169.175 


5.6250 


50.0 


50.0 


32 


8 


1 


17 


157.925 


5.6250 


187.5 


12.5 


32 


8 


2 


18 


169.175 


5.6250 


162.5 


12.5 


32 


8 


5 


17 


157.925 


5.6250 


87.5 


12.5 


64 


4 


1 


33 


155.125 


2.8125 


175.0 


25.0 



regions of the galaxy, some rings adjacent to one another which failed the above 
tests individually were co-added to increase the statistics in the resulting wider 
ring. The errors on the mean radius were, of course, increased in proportion. 
These summed sets of bins were then also subjected to the above tests. Only 4 
further data points could be derived in this way, making a total of 13. It should 
be noted, that because binning the data in these various different ways does not 
invent new data, the 13 points are not completely independent, and should be 
expected to be significantly correlated. However, because using a greater number 
of angular bins narrowed the angular error (usually at the expense of the best 
attainable radial error) and vice versa, using different combinations of bin sizes in 
the two directions better defined the limits of the useful information which could 
be extracted from this data set. The final thirteen data points, with their limits, 
are given below in Table 15.101 

It can be seen that there is a tight correlation in the angles in each ring which 
contained the most data points. The data clearly cluster around ~ 161°, measured 
(as in all this work) from the positive x axis, which corresponds in turn to ~ 33° 



5.5. Results of Pipeline 



195 



above (on the right) the angle of the M31 disc measured by lde Vaucouleurd (119581 ). 
The best fit constant value for the angle corresponding to the peak number density 
was found to be 161.3°(±1.61° fitting error). 

Two examples of azimuthal distributions, as investigated above, plotted as his- 
tograms for differing radial bin size and angular bin size, are shown in Figures 
15.261 and 15.271 The main peak can clearly be seen to appear in the three outer 
radial annuli in both plots. 

In order to correctly interpret the information gathered above and apply it to the 
investigation of the bulge of M31 it would be necessary to know the completeness 
of our survey for the types of variable objects selected here. At this stage, this 
information is not known. If it were, and it could be proved that the variable 
objects used here are indeed variable stars and they are located in the bulge, 
then the information contained in the spatial distributions of objects described 
in this section might be used as a proxy for the more general distribution of 
matter in the inner M31 bulge. Given what is currently known, however, it is 
only possible to conclude that the distribution of variable objects selected by the 
candidate selection pipeline is centrally concentrated, with what appears to be 
a central hole probably due to the incompleteness of our survey, and that the 
peak of its radial distribution do e s not align either with the angle of the outer 
disk me asured by 



by e.g. 



Lindblad 



de Vaucouleurd (119581 ) or with the angle of the bulge measured 



1956). We cannot conclude that the actual distribution of 



variable stars is inconsistent with either of these orientations, due to our lack of 
knowledge of the completeness of our survey. The radial distribution of variable 
objects, and specifically the form of the central hole in the data, is investigated 
in the next section, to gain what knowledge can be gained at this stage about the 
central incompleteness. 
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Radial distribution of variables and central completeness 

As can be seen from Figure 15.221 the centre of the galaxy, and hence that of 
the spatial distribution of variable candidates, is not central to the mean image 
frame position. This made investigating the radial distribution a little more 
complicated than would otherwise be the case. It was decided to divide the space 
into concentric circular annuli, this time of equal area, since then the numbers 
of objects within each ring could be fairly compared. It was noted that out to a 
radius of about 60" the distribution is not affected by the edges of the data, and 
further out in radius to about 120" the circular annuli are only crossed by the 
edge of the data on the right hand side. At greater radii than this, corrections 
to the number distribution would become much more complicated due to the 
uneven shape of the data distribution, and by intersecting the corners. Noting 
the above two groups of r < 60" and 60" < r < 120", the outer radius of the 
largest circle and the number of circles were carefully selected so as to make one 
circular annulus have its outer rim at 60" and another at 120". (These values were 
r max = 280" and n ring = 220). Once this had been set, the number of annuli could 
be adjusted by multiplying or dividing by a whole number in order to adjust the 
scaling of the data in each bin and hence the spatial resolution and corresponding 
statistical noise of each bin. The data inside r = 60" were taken unchanged, while 
the data in the second, outer, area were selected to have their angle 9 measured 
from the positive x axis NOT in the sector arctan(— 2) > 9 < arctan(2) which 
corresponds to being outside two lines drawn between the origin and coordinates 
(in arcsecs) of (60, 120) and (60, —120) respectively. A small amount of data had 
therefore to be discarded, but not a significant amount. The remaining data were 
scaled by the angular factor 360 _( 2 aretan(2)) rec L u i r ed to correct for the lost sector. 
In this way a non-biased sample was obtained out to r = 120". It was clear, both 
from the resulting histogram and from the original scatter plot (Figure I5.22p 
that the data were seriously incomplete in the very central region. The numbers 
in each bin continued rising, however, at a consistent and smooth rate moving 
inwards from the outer regions until inside roughly (r < 20") where a sudden fall- 
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off began. Since no clear rounding off of the rise in numbers could be detected 
(with these statistics) outside r = 20" it seemed reasonable to attempt to fit an 
appropriate function to the data outside this radius in order to quantify the way 
the inner completeness chan ged with radius. The appropriate function chosen 
was de Vaucouleurs "r^" law f ide Vaucouleursl . ll948l ). generalised to indices other 



than 4 by Sersic (jSersid . Il963[ ) which has been applied both to elliptical galaxies 



and to the bulges of spiral galaxies. This function can be written: 

J( r ) = I e- bi -^- l) (5.13) 

where I(r) represents the surface brightness at radius r, the constant b is chosen 
so that half the light falls within the scale length r e and Jo is the surface brightness 
scaling constant (i.e. the surface brightness at r e ). n in this case is the "Sersic 
index", where the value n = 4 was chosen by de Vaucouleurs. For values of n > 1, 
b ~ 1.999n — 0.327. Therefore, with n = 4, b was chosen to be 7.67. The choice 
of this function is not meant to imply that the distribution of variable objects 
is being assumed to be a proxy for the shape of the bulge, but only that it is a 
reasonable choice of a centrally rising function with which to fit the rising portion 
of the histogram presented in Figures 15.281 and 15.291 

Since the incompleteness in the central regions was unknown and was the current 
object of investigation, it was not thought that the data justified varying the 
Sersic index n from 4, since the central incompleteness and the index affect one 
other circularly; only one can be chosen as the unknown. 

Making the necessary assumption that the data outside r « 20 are sufficiently 
close to being complete (although it was already known from plots such as, for 
example, the upper panel of Figure 15.231 that this assumption is not true for low 
variable amplitudes), all data points outside r = 20 were fitted with Equation 
15.131 using only two variable parameters, the scale length r e and the scaling 
factor Iq. To obtain a fit, errors in the numbers N in each bin were taken to be 
y/N. Using these errors, the best fit value of Jq was 0.57io'ii an d the best fit 
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scale length r e was found to be 1900+™", (or 32'). For the reasons stated above 
it is not very surprising that these figures are not physically meaningful. The 
histogram of radially binned data and the best fitting function above are plotted 
in Figure 15.281 

In order to investigate the region inside r = 20" in more detail, the number of 
annuli was increased to 1100, which was the largest number found to produce 
sufficiently numerically stable results in this region. This many divisions is too 
many for the outer regions, but was necessary to provide sufficient resolution in 
the central region of interest. It was found that the numbers of objects declined 
linearly, within the noise, to zero with decreasing radius. This decline was mod- 
elled by a linear function which passed through the centre of the seventh bin. 
This had gradient 0.83391 arcsec -1 . The point at which this line met the pre- 
viously fitted de Vaucouleurs function (re-scaled appropriately downwards by a 
factor of five) was found by iteration to be r = 23.065". The new histogram with 
the modelled fits in the two regions is shown in Figure 15.291 

Assuming that, in the region r < 23.065", the original number of objects with no 
decline in completeness would have risen in the same way as the de Vaucouleurs 
fit, then the radial variation of the completeness could be modelled as the ratio 
of the linear and de Vaucouleurs fit functions, which can be written as: 

C(r < 23") = mrA^ (220/n)e" 7 - 67( ^*" 1) (5.14) 

where m is the gradient of the linearly declining portion = 0.83391arcsec _1 , N(0) 
is the fitted number scaling factor = 0.5713, n = number of rings used and 
r e = 1905.6. This function is shown plotted in Figure 15.301 for the inner 50". 

It should be clearly stated that this model is only an approximation based on sev- 
eral major assumptions and that a calculation of the true completeness function 
(including its variation with the amplitude of the variable) would require a full 
Monte Carlo analysis, the scope of which is beyond that of this thesis. The func- 
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tion above, in reality, would almost certainly not equal 1 at r = 23" as it has been 
previously shown above that there is some incompleteness affecting larger radii 
than this for lower amplitude variables. Also, the details of the central complete- 
ness "hole" modelled here will vary depending on the details of the photometry 
pipeline used, for example whether the mask over the centre of the galaxy varies 
in size with each image, or is taken to be a constant. The data here selected 
as "variable stars" will not have detected all those objects in that category due 
to the algorithm used, which will not be perfect. However, using the currently 
selected data it is the best approximation that can be made to the true radial 
variation of the completeness for amplitudes of variable in this data set greater 
than about 5 ADU/s. This function should not be assumed to be applicable to 
microlensing events due to their differing amplitude distribution and selection 
criteria. 

5.5.2 Microlensing Candidates 
Results using 2007 photometry run lightcurves 

Once the four runs of the candidate selection pipeline (using three varying PA/LT 
flux ratios and also no fixed ratio) had been completed, a further script was used 
to do the selection using tighter values of the cut parameters. Doing the selec- 
tion in two parts like this had the advantage that one still has all the necessary 
information on a wider range of lightcurves if it is required to go back and change 
the cut values for any reason. As shown in Figure I5.16[ it was discovered after 
analysing the variable stars that apparently the sizes of the error bars on the flux 
points had apparently been underestimated by a factor of approximately y/(1.97). 
Therefore, using values of x 2 /d.o.f. as dictated by normal theory (originally cho- 
sen to be global x 2 /d.o.f. < 5.0, local x 2 /d.o.f. < 2.0) would have resulted in 
cuts that were much too strict and rejected many candidates that were in fact 
good enough. Therefore, the selection process was repeated using more relaxed 
values (by a factor 1.97, making them now global x 2 /d.o.f. < 9.85, local x 2 /d.o.f. 
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< 3.94) for the two cuts which relied on x 2 /d.o.f. During the course of this work 
it became apparent that in some lightcurves more than one variable period is 
clearly visible. This would be expected from experience as the probability that 
the PSFs of variable objects in the Angstrom variable object database overlap 
is non-negligible, given the density of variable objects, as can be seen in Figure 
14.11 Ideally all of these periods would be subtracted before attempting to find 
microlensing. However, in this work the "mixed" fit only contains the possibility 
of modelling the periodic baseline using a single period, and the pure reduced 
Paczyhski clearly none, and hence there will often be residual variations left in 
the baselines, which always act to increase the y 2 of a fit if the model does not 
include them. These may or may not be fitted well by subsequent residual itera- 
tions, but if the mixed fit was conducted in an iteration where there existed more 
than one periodic variation in the baseline, then the applied x 2 /d.o.f. cuts will 
still be too strict even after the error scaling described above. Since the global 
X 2 /d.o.f. fit was maintained at a higher figure than the local one anyway, to 
cater for the larger portion of baseline included in this figure, it was felt that the 
major un-catered-for effect would be on the varying baseline causing the fit to 
the Paczyhski component to be degraded. This would obviously affect fits with 
longer £fwhm to a greater extent. To cater for this possibility, the selection was 
also performed with a more relaxed local x 2 /d.o.f. of value 5, referred to in Table 
15.111 below as the "intermediate" cut levels. 

In Table 15.111 all the results for the four selection runs and for the three sets 
of cut values described above, as well as for the same "loose" x 2 /d-o.f. cuts 
employed by the main selection pipeline, are summarised. The other cut values 
employed were always constant at: "lensing ratio" > 2, roughly equivalent to 
requiring at least a 2a detection, x 2 difference ratio > 0.1 and the peak sampling 
condition as described in Section 14.51 (for both long period and "normal" can- 
didates). The value of the x 2 difference ratio > 0.1 was chosen to be relatively 
low, after some experimentation, to allow non-zero numbers of candidates to pass 
even with the stricter values of x 2 /d.o.f.. The numbers of lightcurves categorised 
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Table 5.11: Table summarising the numbers of microlensing candidates selected 
for three varying cut levels and four PA/LT flux ratio regimes, using the 2007 
photometry. 





Free Ratio 


"low" 


"medium" 


"hig 


h" 


"combined" 


X 2 Cuts 


Fpa/F 


LT = 


7.8 


9 


11.1 


14.8 




values 


norm. 


lplc 


norm. 


lplc 


norm. 


lplc 


norm. 


lplc 


norm. 


lplc 


(global, local) 






















orig.(5,2) 


2 


8 


2 


8 


1 


7 


2 


7 


4 


17 


scaled(9.85, 3.94) 


20 


28 


9 


47 


9 


49 


16 


46 


33(-i) 


93 


"inter." (9.85, 5) 


29 


40 


20 


66 


16 


68 


24 


64 


49(-i) 


121 


"loose" (15, 15) 


63 


49 


55 


103 


54 


107 


60 


103 


115(-3) 


362 



as "long period, low contrast", or "l.p.l.c." have also been included. The final 
column describes the final result after the results from the four flux ratio runs 
were concatenated, ordered by lightcurve number, and then the fit with the high- 
est quality factor chosen in each case where the same lightcurve had been found 
by more than one of the four runs. Since some lightcurves are only selected by 
one run and not by others, these combined figures would usually be expected to 
be larger than all of the individual runs. 

The "combined" results of the four PA/LT flux ratio regimes for the events se- 
lected by the "intermediate" cuts are presented in Tables 15.121 and 15.131 Included 
are the values of the parameters used to make the cuts and the resulting quality 
factor values. The table has been ordered by quality factor. Where the candidate 
was selected by more than one flux ratio run, the letters N,L,M,H have been used 
to indicate that it was found by the "No fixed flux ratio", "Low", "Medium" or 
"High" flux ratios respectively. Within each lightcurve where there are multiple 
flux ratio fits the fits were ordered and the one included in Tables 15.121 and 15.131 
chosen by the lowest Xgiob/d-°-f-- The values of PA/LT flux amplitude for the 
three fixed ratio runs were 7.89, 11.1 and 14.8 respectively. The type of fit per- 
formed, i.e. reduced Paczyhski or "mixed" is shown under "Fit". In cases where 
exactly the same fit was found by several of the runs (for example when only LT 



5.5. Results of Pipeline 



202 



data existed and hence the flux ratio with PA data was irrelevant), the above 
letters are separated using an "=". 'BSQF' represents the "bump sample quality 
factor" , and "LSN" represents the "lensing signal to noise" parameter. It will also 
be noticed that there is one instance in Table l5"T2l of two lightcurves with identi- 
cal parameters (161,7319). This was due to a duplication (or near duplication) of 
lightcurves which could occur in the DIA pipeline, as explained above in Section 
15.3.31 The combined figures for "normal" and "long period" events in Table 15.111 
have been corrected for this effect, and the number of independent objects are 
shown in the table (as opposed to lightcurves), with the number of duplicates 
removed shown in brackets, if any existed. The four candidates retained even 
with the strictest x 2 /d.o.f. cuts are highlighted with asterisks on the left hand 
side. ("CDR" again stands for "% 2 difference ratio") 

If the threshold for Cut 8 (lensing ratio) was raised to 3.0, which would be a 
more normal value to choose, being effectively a 3a detection as opposed to a 
2a one, then the number of selected candidates was reduced to 18. These are 
shown in Table |5~TH with their fitted values of ^fwhm- Clearly, as this threshold 
is raised, the number of "false" detections should decrease, but there were still 
several notable candidates within the larger list in Tables 15". 12l and l5.13l that might 
still be considered "good" candidates, despite their lower signal to noise, since 
they have very low x 2 values. Obviously one cannot take this process to its limit 
(lensing ratio— > 0, \ 2 ~~ >* 1) due to likely confusion with variable stars but the 
boundary is necessarily "fuzzy" . Two examples from Tables 15.121 and 15.131 which 
appeared by eye to at least be worthy of further consideration were lightcurves 
(161,7319) and 2335, and these are shown in Figures [5.311 and 15.321 below. (In all 
lightcurve plots below this point, blue points = LT data, red = FTN data and 
black = PA data). 
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Table 5.12: Table (part I) summarising the selection information of microlensing 
candidates selected for global x 2 /d.o.f.< 9.85, local x 2 /d.o.f.< 5.0, lensing ratio 
> 2.0 and the combined results of the four PA/LT flux ratio regimes, using the 
2007 photometry. 



LC 


OF 


R,un/s 


iter 


Fit 


V, 2 
Aloe 


LSN 


CDR 


v 2 . 

A glob 


BSQF 


in in 


3315 


6.063 


NHMt 

±y ^ J. J. ^ lv± ^ J_J 


1 


M 


2.398 


5.924 


0.808 


2.821 


1.477 


PA 


13776 


6.007 


N 


1 


M 


2.565 


2.321 


3.840 


1.860 


1.183 


PA 


6535 


3.543 


N H M L 


1 


M 


3.973 


6.686 


0.550 


2.603 


1.124 


PA 


2064 


2.595 


H,L 


1 


M 


3.306 


2.778 


1.144 


3.046 


1.384 


PA 


3282 


2.563 




1 


M 


3.738 


2.221 


1.739 


2.346 


1.282 


ANG 


8679 


2.510 


H,L 


1 


M 


3.799 


4.274 


0.188 


3.554 


1.818 


PA 


9086 


2.168 


M,L,H 


1 


M 


0.599 


2.091 


0.468 


2.102 


0.954 


PA 


7319 


2.061 


N 


1 


M 


2.749 


2.047 


1.074 


3.468 


1.509 


PA 


161 


2.061 


N 


1 


M 


2.749 


2.047 


1.074 


3.468 


1.509 


PA 


5947 


1.860 


N,L,H 


1 


M 


4.006 


2.163 


2.708 


3.633 


0.886 


PA 


7874 


1.841 


N,L,M,H 


1 


M 


4.767 


2.797 


1.820 


3.000 


0.906 


PA 


7201 


1.725 


M 


1 


M 


2.781 


2.114 


0.314 


3.176 


1.850 


PA 


762 


1.607 


H,M,N,L 


1 


M 


2.630 


3.098 


0.308 


4.185 


1.352 


PA 


9799 


1.536 


H,L 


1 


M 


0.209 


5.584 


0.544 


3.746 


0.352 


PA 


9920 


1.504 


L 


1 


M 


4.247 


3.693 


0.693 


7.192 


1.376 


PA 


16055 


1.456 


N 


1 


M 


4.578 


3.691 


0.842 


3.593 


0.875 


PA 


4853 


1.452 


N 


1 


M 


1.903 


2.510 


0.412 


1.806 


0.760 


ANG 


6056 


1.432 


N 


1 


M 


3.265 


2.165 


0.739 


4.282 


1.435 


PA 


1643 


1.418 


H 


1 


M 


4.387 


2.203 


0.661 


4.507 


1.723 


PA 


5791 


1.310 


L,N,M,H 


1 


M 


3.471 


2.570 


1.289 


4.486 


0.886 


PA 


9189 


1.278 


N 


1 


M 


2.910 


2.121 


0.493 


4.243 


1.443 


ANG 


1732 


1.252 


H 


1 


M 


4.760 


3.644 


0.184 


4.729 


1.377 


PA 


5383 


1.244 


L,N 


1 


M 


4.838 


2.607 


0.619 


3.193 


1.183 


PA 


2335 


1.221 


N 


1 


P 


1.713 


2.041 


0.775 


1.325 


0.512 


ANG 


5381 


1.199 


N,M,H 


1 


M 


4.119 


2.178 


0.191 


3.881 


1.850 


PA 
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Table 5.13: Table (part II) summarising the selection information of microlensing 
candidates selected for global x 2 /d-o.f.< 9.85, local x 2 /d.o.f.< 5.0, lensing ratio 
> 2.0 and the combined results of the four PA/LT flux ratio regimes, using the 
2007 photometry. 



LC 


QF 


Run/s 


iter 


Fit 


Xloc 


LSN 


CDR 


Xglob 


BSQF 


t in 


18106 


1.114 


H 


1 


M 


3.423 


2.033 


0.173 


3.725 


1.670 


PA 


7662 


1.026 


H 


1 


M 


3.652 


3.424 


0.396 


5.566 


0.989 


PA 


11057 


0.995 


N,L 


1 


M 


3.661 


2.085 


0.504 


3.934 


1.204 


PA 


6736 


0.990 


N 


1 


M 


4.470 


2.969 


0.574 


2.180 


0.704 


PA 


2075 


0.987 


H,M,L 


1 


M 


3.865 


2.602 


0.312 


6.354 


1.477 


PA 


2675 


0.978 


L 


2 


M 


4.085 


14.215 


1.491 


2.933 


0.097 


ANG 


1033 


0.976 


H 


2 


M 


3.349 


3.212 


0.382 


4.613 


0.875 


PA 


12648 


0.946 


L,M,H 


1 


M 


4.604 


2.881 


1.036 


2.862 


0.602 


PA 


4654 


0.905 


N 


1 


M 


4.842 


3.117 


0.643 


3.807 


0.764 


ANG 


3328 


0.894 


L,M 


1 


M 


3.877 


2.065 


0.933 


9.280 


1.473 


PA 


15888 


0.833 


M,H 


1 


M 


4.279 


2.558 


0.306 


6.793 


1.380 


PA 


13817 


0.704 


L=M=H 


1 


M 


3.717 


2.145 


0.889 


4.570 


0.720 


PA 


3911 


0.617 


M 


1 


M 


4.035 


2.481 


0.183 


6.048 


1.061 


PA 


7714 


0.616 


N 


1 


M 


1.789 


2.747 


1.042 


8.114 


0.544 


PA 


15294 


0.577 


N 


1 


M 


3.854 


3.174 


0.364 


6.304 


0.677 


PA 


13966 


0.573 


N 


1 


M 


3.406 


5.454 


1.188 


6.061 


0.227 


PA 


9092 


0.528 


N 


1 


M 


1.989 


3.303 


0.525 


5.402 


0.387 


PA 


6163 


0.491 


H 


1 


M 


2.673 


2.724 


0.172 


6.698 


0.720 


PA 


13318 


0.446 


H 


1 


M 


1.576 


2.290 


0.121 


6.705 


0.720 


PA 


7256 


0.389 


N,H,M 


1 


M 


2.656 


2.317 


0.431 


8.897 


0.677 


PA 


6354 


0.322 


N 


1 


M 


3.609 


3.300 


1.467 


7.864 


0.227 


PA 


16823 


0.309 


N 


1 


M 


3.078 


4.765 


0.862 


2.482 


0.097 


PA 


4286 


0.289 


L 


1 


M 


4.881 


7.786 


0.745 


4.222 


0.097 


PA 


1237 


0.157 


N 


1 


M 


3.705 


2.717 


0.251 


5.436 


0.211 


PA 


8239 


0.055 


N 


1 


P 


4.493 


2.049 


0.215 


4.257 


0.097 


PA 
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Figure 5.26: Histogram showing the variation in the angularly binned numbers 
of variable candidates, divided also into four radial bins, each of radial width 
50". The major division in azimuthal angle (horizontally), is 20 degrees, and in 
number, 50 objects.) 
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Figure 5.27: Histogram showing the variation in the angularly binned (with 32 
equal bins) numbers of variable candidates, divided also into four radial annuli, 
each of radial width 50". The major division in azimuthal angle (horizontally) is 
20 degrees, and in number, (vertically), 20 objects. 
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Figure 5.28: Histogram showing the radial distribution of variable star candidates 
in the inner 120" of the M31 bulge along with the best fit de Vaucouleurs profile. 
This plot used 220 circular annuli of equal area between r = 280" and r = 0. 
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Histogram of radial distribution of variable star candidates 
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Figure 5.29: Histogram showing the radial distribution of variable star candidates 
in the inner 50", focusing particularly on the inner 20". This plot used 1100 
circular annuli of equal area between r = 280" and r = 0. 
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Plot of approximate radial dependence of completeness 
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Figure 5.30: Plot of the radial dependence of the estimated spatial completeness 
function in the inner 50" of the bulge, assuming that the completeness is 1 at 
r = 120", and using a de Vaucouleurs profile fit to the real data for r > 23.065" 
and a linear fit inside that. 
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Figure 5.31: Plots of the lightcurve 161/7319 in the 2007 photometry. The panels 
are: 1) & 2) The original lightcurve in three bands with the full mixed fit. 3) The 
full lightcurve when the first iteration variable component has been subtracted 
and the data have been scaled and plotted on the same time axis. 4) A zoom in 
on the peak region of panel 3). 5) The residuals remaining after subtraction of 
the mixed fit shown in panels 1) & 2). 
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Table 5.14: Table summarising the information about the fitted peak of mi- 
crolensing candidates selected for global x 2 /d.o.f. < 9.85, local x 2 /d.o.f. < 5.0 
and the combined results of the four PA/LT flux ratio regimes, using the 2007 
photometry. 



LC 


QF 


Run/s 


iter 


Fit 


t in 


^FWHM 


3315 


6.063 


N,H,M,L 


1 


M 


PA 


82.018 


6535 


3.543 


N,H,M,L 


1 


M 


PA 


130.399 


8679 


2.510 


H,L 


1 


M 


PA 


388.182 


762 


1.607 


H,M,N,L 


1 


M 


PA 


84.963 


9799 


1.536 


H,L 


1 


M 


PA 


6.536 


9920 


1.504 


L 


1 


M 


PA 


40.901 


16055 


1.456 


N 


1 


M 


PA 


19.428 


1732 


1.252 


H 


1 


M 


PA 


76.637 


7662 


1.026 


H 


1 


M 


PA 


83.242 


2675 


0.978 


L 


2 


M 


LT 


2.881 


1033 


0.976 


H 


2 


M 


PA 


6.195 


4654 


0.905 


N 


1 


M 


LT 


24.966 


15294 


0.577 


N 


1 


M 


PA 


21.057 


13966 


0.573 


N 


1 


M 


PA 


2.958 


9092 


0.528 


N 


1 


M 


PA 


7.838 


6354 


0.322 


N 


1 


M 


PA 


3.081 


16823 


0.309 


N 


1 


M 


PA 


7.312 


4286 


0.289 


L 


1 


M 


PA 


8.917 



As can be seen from Tables 15.121 and 15.131 the large majority of the candidates 
selected were "Mixed" with t situated within the PA data. The next largest 
category was "Mixed,Angstrom" and there were only one each of (Pac,PA) and 
(Pac,Angstrom). The great majority of "mixed" events was not too surprising, 
given the degree of blending expected in the core of M31, but the size of the 
majority of candidates with t i n the PA data was surprising. A majority of 
some kind would have been predicted simply from the number of data points in 
each band, of which the PA data at the time had more, with a greater number of 
complete seasons (4) to the LT's two (+ the pilot season). Another reason could 
be a phenomenon noticed during the development of the selection routine, namely 
the prevalence of very short, almost vertical "spikes" within the PA data, which 
would naturally be attractors to a Paczyhski fit. It is not yet known whether these 
photometric phenomena are caused by physical events or due to some unusual 
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data processing problem, and this will require further investigation. 

The third and least pleasant explanation might be that the quality of the photom- 
etry from the version of the DIA pipeline used to generate the flux points used in 
this investigation was not as good as it might have been. From the investigation 
of the short high signal to noise event presented in this Chapter in Section I5.3[ 
and similar investigations into other high signal to noise variations highlighted 
by the APAS, certain phenomena such as apparent photometric oscillations after 
particularly bright flux points had been noted. Changes were made (among oth- 
ers) to the way the position of the objects were calculated in order to ensure that 
photometry was calculated closer to the position corresponding to the brightest 
fluxes (after these had occurred). All the improvements to the DIA pipeline were 
complete before the generation of the photometry for the 2008 candidate selection 
run, reported below. As later discovered (see Section I5.3.7p . given that a "per- 
fect" event in the 2008 photometry would have x 2 /d.o.f. = 1.96 on average, the 
sizes of the best % 2 /d.o.f. values in Tables 15.121 and 15.131 are about what would 
be expected, with only 3 out of the 50 selected events having x 2 /d.o.f. values less 
than 1.96. For illustration, lightcurves for the first five selected events in Table 
15.121 are presented below in Figures (15.331 to f5.37[) . 
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Figure 5.32: Plots of the lightcurve 2335 in the 2007 photometry. Panels are: 
1) The LT lightcurve (no PA or FT data existed for this lightcurve) with the 
reduced Paczyhski fit. 2) Zoom in on the peak region of 1). 3) Residuals after 
subtraction of the reduced Paczyhski fit. 
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Figure 5.33: Lightcurve of object number 3315 in the 2007 photometry, showing 
a) the original mixed fit and b) the peak region of the lensing component only, 
after the variable component has been subtracted. 
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Figure 5.34: Lightcurve of object number 13776 in the 2007 photometry, showing 
1) the original mixed fit and 2) the peak region of the lensing component only, 
after the variable component has been subtracted. 
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Figure 5.35: Lightcurve of object number 6535 in the 2007 photometry, showing 
a) the original mixed fit and b) the peak region of the lensing component only, 
after the variable component has been subtracted. 
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Figure 5.36: Lightcurve of object number 2064 in the 2007 photometry, showing 
a) the original mixed fit and b) the peak region of the lensing component only, 
after the variable component has been subtracted. 



5.5. Results of Pipeline 



218 



LT(i) JTH(l) Sealad fluiaa LC -jDlnl Seducad-Paoipiald/aksii-CDidiiuanM Fit 




880D S'tW 9S0O 

JE-215O00O.S (dupi) 



LT(i> SralBd fluxes LC 0D3SB2 -Joint RsiueBd-Paci^iiBM/51:ff»-SimiKJld Fit Iteration - 1 




3960 «ao 

JD-245O00O.5 (d*H 



Figure 5.37: Lightcurve of object number 3282 in the 2007 photometry, showing 
a) the original mixed fit and b) the peak region of the lensing component only, 
after the variable component has been subtracted. 
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Results using 2008 photometry run lightcurves 

When the latest version of the Angstrom DIA photometry pipeline was complete, 
new lightcurves were produced, and the selection pipeline re-run on these new 
data. Only LT and PA data were available initially, and so these were used for 
the analysis. The pipeline was also run on the LT data on their own, to enable 
the effect of adding in the PA data to be evaluated at a later date. 

Once the list of lightcurves which passed the "loose" cuts was known, the data 
were analysed to calculate the quality factors, as in Section 15.5.21 Also as in Sec- 
tion I5.5.2[ the resulting list was checked for groups of lightcurves with identical 
quality factors (and hence fitting parameters) or very similar t and £fwhm and 
hence similar quality factors. Both of these sorts of groups were treated as single 
objects for the rest of the analysis. Any pairs of objects for which the parameters 
were very similar, but not similar enough to be certainly from the same object 
had their physical X,Y coordinates checked to firmly identify whether they were 
different objects or not. When the data were grouped in this way, which cor- 
responded to an estimate of the numbers of independent astrophysical objects 
present in the sample, the numbers of clusters were considerably lower than the 
number of lightcurves, even after removal of all the identical sets. For example, 
at one point in the final development of the pipeline using the LT and PA run, 
using the "inter" cuts, there were initially 355 lightcurves selected, but it seems 
that these represent only about 177 actual objects. This seemed to be a startling 
rate of lightcurve degeneracy, being in this example roughly a 2-1 ratio. The 
same level of repeated lightcurves was not seen in the 2007 photometry, although 
they did occur. 

Overall, the total numbers of objects selected using only LT or both LT and PA 
data were found to be of comparable order but the average effect of using the 
PA data in addition to the LT data was to reduce significantly the number of 
lightcurves selected. The ratio of numbers in the three categories of x 2 /d.o.f. cut 
used above for the 2007 data, (i.e. "loose" , "inter" and "scaled") where sufficient 
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statistics existed, with and without PA data, was calculated to be a remarkably 
stable 0.36. That adding PA data reduced the numbers of selected lightcurves 
might have been predicted as the longer baselines this makes possible provide 
more opportunities for periodic variables to repeat, which otherwise might have 
been consistent with one of the two kinds of lensing fits performed in this work. 
Also, a greater chance exists for variable stars which do not quite fit the relatively 
simple model used here to exhibit variations which might not have occurred within 
the time range of the LT data alone, hence increasing the global x 2 /d.o.f. and 
perhaps excluding them from selection. Conversely, the addition of the PA data 
would have been expected to increase the number of detected variable stars, if 
the 2007 data were a reasonable guide. This has not yet been checked. 

Final selection of candidates 

After the analysis of the 2007 data, Cut 12 was added to the selection procedure. 
Using all 12 cuts, the "loose" cuts allowed through 145 lightcurves. After much 
experimentation with various cut levels, a set of cuts was decided upon that 
aimed to produce as good a set of candidates as possible without introducing 
any clearly insufficiently well fitted lightcurves. These cuts were: lensing ratio; 
LR > 3.0, x 2 difference ratio; CDR > 0.4, and the two x 2 /d.o.f. upper limits 
were set at Xg lob /d.o.f.< 7.0, xf ocal /d.o.f.< 5.0 respectively, which is equivalent 
to Xgiob/d-°-f-— 3-6, xfocai /d.o.f.< 2.6 for photon noise limited data. The values 
of LR and CDR were both set significantly more strictly than for the 2007 data 
set to cut down the level of false positives that the pipeline could allow through. 
The values of Xg lob /d.o.f. and xf ocal /d.o.f. were found originally empirically by 
experimentation, always maintaining the value of x^j/d.o.fX Xgiob/d-°-f-> but 
once the necessary factor of ~ 2 was known about these both also seemed to be 
sensible values. Using the above values, 38 lightcurves were selected, which proved 
to correspond to 20 physical objects, of which 2 were simple reduced Paczyhski 
fits. All of these were selected on the first iteration. The fitting parameters of 
these 20 lightcurves are presented in Tables 15.151 and 15.161 The parameters t 
and £fwhm which describe the shape of the lensing peaks are detailed in the two 
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Tables EH] and EH 

If the "short event" is included in this list (since with the addition of the Maidanak 
data and the re-scaling of the LT error bars it would have passed all the above 
cuts), then the total number of events would become 21. 

Spatial and timescale distributions 

By examining the information in Tables 15.171 and 15.181 it may be seen that the 
majority of candidates have event timescales (as measured by their £fwhm) less 
than 30 days. One way of binning the 20 selected events leads to the information 
contained in Table 15.191 While 2/3 of the candidates have £fwhm < 30 days, 
in actual fact 65% have ^fwhm < 20 days. However, in addition to this short 
duration rump is what appears to be a longer duration tail containing the other 
5 candidates. Number statistics are clearly too low to make any more detailed 
comments about the timescale distribution at this stage. 

In Figure 15.381 is shown the spatial distribution of the 20 selected candidates. 
Although this plot also obviously suffers from a lack of statistics, some qualita- 
tive observations can still be made. Firstly, there seems to be a possible "E-W" 
asymmetry in the numbers of candidates. For example, if the perpendicularly 
crossed lines which are oriented in the direction of the previously measured bulge 
orientation of 10° are taken as a reference, there are 14 candidates on the Western 
(right) side of the N-S line, but only 6 on the Eastern (left) side. However, only a 
small clockwise rotation of these lines would mean the count was 11/9, which is 
much less significant. Secondly, there appears to be a near-far asymmetry, with 
the nearer part of the galaxy towards the top right having fewer candidates, al- 
though the number densities would need to be worked out using the true available 
areas given the shape, orientation and overlaps of the Angstrom fields before this 
conclusion can be made firmly. 
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Table 5.15: Table (Part 1) summarising the selection information of microlensing 
candidates selected for global x 2 /d.o.f. < 7.0, local x 2 /d.o.f. < 5.0, lensing ratio 
> 3.0, x 2 difference ratio > 0.4 using the 2008 photometry. 



Obiect No 


LC 


OF 


iter 


Fit 


Y? 

Aloe 


LSN 


CDR 


y 2 , k 

A glob 


BSQF 


tc\ in 


1 


90314 


4.20 


1 


M 


2.27 


4.47 


1.89 


2.15 


0.72 


PA 




6614 


4.20 


1 


M 


2.27 


4.47 


1.89 


2.15 


0.72 


PA 




56125 


4.20 


1 


M 


2.27 


4.47 


1.89 


2.15 


0.72 


PA 




16819 


4.20 


1 


M 


2.27 


4 47 


1 89 


2.15 


72 


PA 


2 


38847 


2.94 


1 


M 


3.11 


3.46 


1.09 


4.19 


1.48 


PA 




87898 


2 67 


1 


M 


4 90 


6.21 


87 


4.92 


1 13 


LT 




49986 


2.67 


1 


M 


4.90 


6.21 


0.87 


4.92 


1.13 


LT 




29986 


2 67 


1 


M 


4 90 


6.21 


87 


4.92 


1 13 


LT 




28743 


2.67 


1 


M 


4.90 


6.21 


0.87 


4.92 


1.13 


LT 


4 


82883 


2.65 


1 


M 


3.24 


3.46 


0.46 


3.71 


1.81 


PA 


5 


83057 


2.30 


1 


M 


2.52 


4.33 


1.22 


3.35 


0.70 


PA 


6 


39599 


1.59 


1 


M 


1.96 


3.67 


0.51 


3.55 


0.79 


LT 




81226 


1.59 


1 


M 


1.96 


3.67 


0.51 


3.55 


0.79 


LT 




91968 


1.59 


1 


M 


1.96 


3.67 


0.51 


3.55 


0.79 


LT 


7 


8391 


1.49 


1 


M 


2.48 


6.26 


0.56 


3.01 


0.42 


PA 


8 


53608 


1.46 


1 


M 


4.27 


3.23 


0.41 


2.75 


1.12 


LT 


9 


26795 


1.29 


1 


M 


3.55 


6.71 


0.56 


3.73 


0.45 


PA 


10 


68058 


1.27 


1 


M 


3.69 


4.07 


0.53 


4.89 


0.88 


PA 




74112 


1.27 


1 


M 


3.70 


4.06 


0.53 


4.89 


0.88 


PA 
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Table 5.16: Table (Part 2) summarising the selection information of microlensing 
candidates selected for global x 2 /d.o.f. < 7.0, local x 2 /d.o.f. < 5.0, lensing ratio 
> 3.0, x 2 difference ratio > 0.4 using the 2008 photometry. 



Object No 


LC 


QF 


iter 


Fit 




LSN 


CDR 


Xglob 


BSQF 


t in 


11 


29137 


1.13 


1 


M 


0.64 


3.01 


0.41 


2.00 


0.35 


LT 


20 


60097 


1.10 


1 


M 


3.99 


4.22 


0.50 


2.87 


0.60 


PA 


21 


13139 


1.08 


1 


M 


2.71 


4.05 


0.72 


2.71 


0.42 


PA 




3957 


1.08 


1 


M 


2.39 


3.51 


0.81 


2.56 


0.42 


PA 




995 


1.07 


1 


M 


2.37 


3.48 


0.81 


2.56 


0.42 


PA 


22 


2616 


0.98 


1 


M 


3.55 


3.83 


0.43 


0.78 


0.39 


PA 




24441 


0.98 


1 


M 


3.55 


3.83 


0.43 


0.78 


0.39 


PA 


23 


42645 


0.95 


1 


M 


2.91 


5.83 


0.62 


3.38 


0.31 


PA 


24 


54978 


0.82 


1 


M 


4.38 


4.24 


0.66 


2.27 


0.39 


PA 


25 


77494 


0.59 


1 


P 


3.34 


3.62 


0.81 


3.59 


0.31 


LT 


26 


74650 


0.51 


1 


M 


0.40 


4.62 


1.89 


4.62 


0.10 


PA 




19644 


0.51 


1 


M 


0.40 


4.62 


1.89 


4.62 


0.10 


PA 


27 


81698 


0.46 


1 


M 


4.02 


4.67 


0.42 


5.00 


0.31 


PA 




28357 


0.46 


1 


M 


4.00 


4.69 


0.42 


5.04 


0.31 


PA 


28 


82746 


0.37 


1 


P 


1.76 


3.31 


0.43 


4.51 


0.24 


PA 
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Table 5.17: Table (Part I) summarising the information about the fitted peak of 
microlensing candidates selected for global x 2 /d.o.f. < 7.0, local x 2 /d.o.f. < 5.0, 
lensing ratio > 3.0, x 2 difference ratio > 0.4 using the 2008 photometry. 



Object No. 


LC 


QF 


iter 


Fit 


to in 


to 


^FWHM 


1 


90314 


4.20 


1 


M 


PA 


1918.19 


25.15 






A 9fl 


i 
i 


M 


PA 


iqic in 


9^ 1 x> 




^fil 9^ 


A 9fl 


i 
i 


M 


PA 




9^ 1 x> 




16819 


4.20 


1 


M 


PA 


1918.19 


25.15 


2 


38847 


2.94 


1 


M 


PA 


1813.39 


64.50 


3 


87898 


2.67 


1 


M 


LT 


4766.22 


78.20 




49986 


2.67 


1 


M 


LT 


4766.22 


78.20 




29986 


2.67 


1 


M 


LT 


4766.22 


78.20 




28743 


2.67 


1 


M 


LT 


4766.22 


78.20 


4 


82883 


2.65 


1 


M 


PA 


1984.58 


434.73 


5 


83057 


2.30 


1 


M 


PA 


2535.37 


45.52 


6 


39599 


1.59 


1 


M 


LT 


4792.65 


17.27 




81226 


1.59 


1 


M 


LT 


4792.65 


17.27 




91968 


1.59 


1 


M 


LT 


4792.65 


17.27 


7 


8391 


1.49 


1 


M 


PA 


2194.58 


10.70 


8 


53608 


1.46 


1 


M 


LT 


4779.34 


96.53 


9 


26795 


1.29 


1 


M 


PA 


2636.51 


16.51 


10 


68058 


1.27 


1 


M 


PA 


2142.92 


10.30 




74112 


1.27 


1 


M 


PA 


2142.93 


10.16 
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Table 5.18: Table (Part II) summarising the information about the fitted peak of 
microlensing candidates selected for global x 2 /d.o.f. < 7.0, local x 2 /d.o.f. < 5.0, 
lensing ratio > 3.0, x 2 difference ratio > 0.4 using the 2008 photometry. 



Object No. 


LC 


QF 


iter 


Fit 


t in 


to 


^FWHM 


11 


29137 


1.13 


1 


M 


LT 


4466.33 


25.12 


12 


60097 


1.10 


1 


M 


PA 


1911.02 


14.90 


13 


13139 


1.08 


1 


M 


PA 


2195.60 


9.43 




3957 


1.08 


1 


M 


PA 


2195.59 


9.10 




995 


1.07 


1 


M 


PA 


2195.59 


9.07 


14 


2616 


0.98 


1 


M 


PA 


1397.83 


6.38 




24441 


0.98 


1 


M 


PA 


1397.83 


6.38 


15 


42645 


0.95 


1 


M 


PA 


1524.50 


13.09 


16 


54978 


0.82 


1 


M 


PA 


1396.19 


4.30 


17 


77494 


0.59 


1 


P 


LT 


3293.69 


18.92 


18 


74650 


0.51 


1 


M 


PA 


1912.95 


6.18 




19644 


0.51 


1 


M 


PA 


1912.95 


6.18 


19 


81698 


0.46 


1 


M 


PA 


1525.44 


8.65 




28357 


0.46 


1 


M 


PA 


1525.54 


9.25 


20 


82746 


0.37 


1 


P 


PA 


2224.60 


3.08 



tFWHM(days) 


Number of lightcurves 


0-15 


10 


15-30 


5 


30-45 





45-60 


1 


60-75 


1 


75-90 


1 


90 - 105 


1 


105+ 


1 



Table 5.19: Table giving a coarsely-binned histogram of the timescales of the 20 
selected candidates from the 2008 photometry 
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Scatterplot of ensing candidate positions. 
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Figure 5.38: Scatter plot showing the spatial distribution of the 20 selected lensing 
candidates. The dark diagonal line shows the orientation of the M31 disk, and 
the lighter lines are at 10° and 100° from the disk direction 

Candidate lightcurves 

Plots of the other 20 selected events are presented below in Figures 15.391 to I5.58[ 
both of the whole lightcurve with LT and/or PA data and of the Paczyhski peak 
part of the data (with the variable fit part subtracted in the case of mixed fits), 
plus the residuals to the chosen fits. Out of the groups of lightcurves with similar 
fit parameters, physical coordinates and quality factors, the lightcurves chosen 
for the plots are either the lightcurve with the highest quality factor, or, if the 
lightcurves and hence quality factors were truly identical, the lightcurve with the 
lowest lightcurve number in the group. 
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MUD Scaled fluiaa LC 01749* -HBdured-Parcjiijld Fit 



ttaratLan — 1 




mm ana 



amn iboo tnao 

JE-2i5O00O.S (djsjn) 



LT[1) Scaled fluiaa LC 01749* -BBduced-FacEjiuld Fit 



Iteration — 1 
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Figure 5.39: Lightcurve of object number 77494 in the 2008 photometry, showing 
Top) the original Paczyhski fit, Middle) the central peak region and Bottom) the 
residuals after subtraction of the fit. 
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LT(1> JtJ&tl) Scaled fhrraa LC (KJET^a -RBdured-Pfl.ciynfllai Fit ttaratlon - E 





Ij 1 


1 1 1 1 1 1 1 1 1 1 


1 1 1 1 


fj 
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— 1 
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1 
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Figure 5.40: Lightcurve of object number 82746 in the 2008 photometry, showing 
Top) the original Paczyhski fit, Middle) the central peak region and Bottom) the 
residuals after subtraction of the fit. 
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Mixed Fit Objects 
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LT(i} ,PAB(1) Scalad ninaa LC DMA 14 -taint I^UGad-P&csynskl/akeir-CoBinuanld Fit Iteration - 1 




awn awn 

JE-2i5O00O.S (d»r») 



LT(i} ,PaB(i) Scalad flnsaa LC DMA 14 -taint I^UGad-P&csynskl/akeir-CoBinuanld Fit Iteration - 1 




IBM iWD 1WS0 i»4D 

JE-2i5O00O.S (d»r») 



LT(i) ,P43(1) Scalad fluiaa LC DM914 -Joint Mus8d-Pac2jm^/ake«--Ciistauaold [It Iteration - S 
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JD-Z45O00O.5 (dojn) 



Figure 5.41: Lightcurve of object number 6614 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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LT(i} ,PAB(1) Scalad ninaa LC D30H47 -taint E*duaad-Pac3pnald/SQEeTr-CDBdiiuanld Fit Iteration - 1 




awn awn 

JTJ-245O00O.5 (d»r») 



LT(i} ,PaB(i) Scalad ninaa LC D30H47 -taint I^UGad-P&csynskl/akeir-CoBinuanld Fit Iteration - 1 




1SD0 IffiD lfOD 17B0 



1B0D 19B0 
JE-215O00O.S (d»r») 



IBM 1950 



LT(i) ,PA3(i) Scolad nuiaa LC D3BB47 -Joint I^ucad-P&csjmsld/SQreir-Cosinuanld Fit Iteration - a 



r 



5500 a(M)D 3WD 

JTJ-245O00O.5 (&•;■) 



Figure 5.42: Lightcurve of object number 38847 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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LT{0 ScaJad fluiHS LC 0BB?4a -Joint Eadncad-PacsyTiald/Skflw-ajriimanld Pit Iteration - 1 




3500 H4D0 



aeon 4dw «oo 
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8r> 




4BW 4B60 4800 4850 
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Figure 5.43: Lightcurve of object number 28743 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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LT(i} ,PAB(1) Scalad ninaa LC DBSBea -taint I^UGad-P&csynskl/akeir-CoBinuanld Fit Iteration - 1 




5W0 3MD 36QD 

JE-215O00O.S (d»r») 



LT(i} ,PaB(i) Scalad nusaa LC DBSBea -taint I^UGad-P&csynskl/akeir-CoBinuanld Fit Iteration - 1 




1500 sooc 
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Figure 5.44: Lightcurve of object number 82883 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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PA3(i) Scaled Duxes 1C 0eaO5? -Joini Reduced-Pac^nsfci/akew-Cnsinufloid Pit Iteration - 1 




14D0 IBM 1600 LIM 18D0 180D BflDO 3100 8E0O HMD £400 EMd 8SM STO 

JB-215O00O.5 (dijni) 



Pa3(1) Scaled Duxes 1C 0ea05? -Joint Reduced-Pac^nsfci/akew-Cnsinufloid Pit Iteration - 1 




i 1 1 — 1 1 i 



SBM 7.SSH 



PA3(i) Scaled flaxes 1C OeaaoT -Joint Reduced-P&csynsfcl/Sfcew-Cnsinuflold Pit Iteration - 8 




i i i ■ ■ ■ 



1300 14D0 IBM 1600 HOD 18D0 1MB BflBO 3100 HSOO E30B ?A0O 7,50(1 MOO 

JD-215O00O.5 (d»ya) 



Figure 5.45: Lightcurve of object number 83057 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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LT{0 ScaJad fluiHS LC 0B9599 -Joint Iteducad-PacsyiiHld/Skflir-CoBiimanLd Pit Iteration - 1 
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Figure 5.46: Lightcurve of object number 39599 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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LT{1) ScaJad Huibs LC 063000 -Joint Eaducad-PacsjuakVSkB^-CoBinuanLd. pit Iteraiion - 1 

— i 1 1 1 1 1 1 1 1 1 1 1 1 1 r 
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Figure 5.47: Lightcurve of object number 53608 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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LT(i} ,FaB(1) Sc-alad flusaa LC DESB795 -Saint Eeduaad-Pacspnald/akeTr-CDBdiiuanld Fit Iteration - 1 




1500 HODO E50Q- 3000 3500 40QD 45GD 

JTJ-245O0M.5 (dajv) 



LT(i} ,FaB(1) Sc-alad flusaa LC DESBV95 -Joint R«lu i:ad-P&c 3 yiiald/'3ke-ir-Cc, shiua aid Iteration - 1 



>n _ 




o - 



SBSD SSia SSB0 8030 13640 SS5D S890 »B7Q 8MQ 

JTJ-245O0M.5 (dajv) 



LT(i) ,PAB(1) Sc-alad fluias LC DBB795 -Joint I^ucad-Pacijmaki/aks^Cosdnusnld [It Iteration - a 











I [ 


■ i ' 


1 1 




















T 


1 1 

i 













1509 BOOa KSDO 800C SBflD 40OD 4BOD SDGD 

JD-Z45O00O.5 (dojn) 



Figure 5.48: Lightcurve of object number 26795 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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Figure 5.49: Lightcurve of object number 68058 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 



5.5. Results of Pipeline 



239 
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Figure 5.50: Lightcurve of object number 8391 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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Figure 5.51: Lightcurve of object number 29137 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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Figure 5.52: Lightcurve of object number 60097 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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Figure 5.53: Lightcurve of object number 13139 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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Figure 5.54: Lightcurve of object number 2616 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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Figure 5.55: Lightcurve of object number 42645 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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Figure 5.56: Lightcurve of object number 54978 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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Figure 5.57: Lightcurve of object number 74650 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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Figure 5.58: Lightcurve of object number 81698 in the 2008 photometry, showing 
Top) the original mixed fit and Middle) the peak region of the lensing component 
only, after the variable component has been subtracted and Bottom) the residuals 
after subtraction of the mixed fit. 
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The candidate event lightcurve 39599 is clearly still in progress when the 2008 
data end. This event has also been independently selected as an interesting 
transient by the Angstrom Aler t System (AAS), was the subject of an alert issued 



by Angstrom via iKerind (120081 ). and has been designated ANG-08B-M31-05. To 
confirm whether the continuation of the data is consistent with lensing, or is more 
consistent with being a nova, the data which have been collected since November 
2008 were added to the lightcurve. 

This plot shows the flux generally falling, as expected, but the cause of the spike 
is still ambiguous to the eye when the variable background amplitude is taken 
into account. Therefore, the selection pipeline was used to re- fit the extended 
lightcurve with a full mixed fit. A good fit to the lightcurve was still obtainable 
as measured by the local and global x 2 / d.o.f., but in order to better fit to the peak 
area, the fit to the variable baseline was clearly degraded noticeably by using a 
less appropriate period. Also, the Lensing Ratio of the lensing event was reduced 
to a point where it was no longer selected according to Cut 8. Because of this, Cut 
12 was not even tested, but it is suspected that this cut might also have failed due 
to the coherently degraded fit to the baseline. Since the original joint fit to the 
un-extended lightcurve of 39599 had a better fit to the baseline, the variable part 
of the original joint fit was subtracted from the extended lightcurve, and then 
a reduced Paczyhski fit was attempted. The result of this is given in the lower 
panel of Figure I5.59[ along with the mixed fit that the selection pipeline would 
normally have fitted (in the upper panel). It can be seen in the lower panel of 
Figure 15". 591 that although the baseline is substantially flatter than in the original 
lightcurve, residual variations still remain, and in the peak area, the data do seem 
to undershoot a Paczyhski fit on the rise, and overshoot it on the fall, which might 
be consistent with a nova lightcurve. This is a fairly subtle effect, however, not 
much more than la, and might be due to the residual variations in the baseline. 
It seems likely that it is these remaining "wiggles" in the peak that the joint fit 
was attempting to reduce by sacrificing some of the baseline fit. The fit to the 
Paczyhski model is still quite good, however. This lightcurve is presently judged 
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to be a close call between a nova or microlensing, and at the time of submission is 
still an on-going event. Perhaps the further extension of the lightcurve with time 
will resolve the issue one way or the other. This event does illustrate some of the 
difficulties that can be caused when the baseline cannot be modelled simply by 
a one component model. Attempting to use a model that is too simple for the 
behaviour of the data can cause good events to be rejected. There may be a case 
for attempting to remove all variability in the baseline independently of the fitting 
of the proposed microlensing peak. This method would be computationally more 
difficult, however, and might introduce problems of its own, for example, in the 
case when the microlensing timescale was greater than the period of the variable. 



5.6 Testing the pipeline with fake lightcurves 

A comprehensive Monte Carlo test of the pipeline, which would be necessary to 
calculate the pipeline efficiency, and hence allow a definitive estimation of the dis- 
tribution of matter in our field of M31 would require relatively complex modelling 
of quantities such as spatial, mass, velocity and luminosity distributions, and by 
hence by implication the anticipated distributions of lensing timescales and am- 
plitudes. To be a realistic model, it would also need to include variable stars, 
and so these would either need to be generated artificially, including their period, 
skewness, amplitude, and spatial distribution, or the fake lensing events seeded 
into the original variable object lightcurves. The extent of this kind of exercise is 
beyond the possible scope of this work, but it is nonetheless felt that testing the 
pipeline in some sort of simplified way, using fake lightcurves, is a useful exercise. 
This testing is reported below. The results gained, due to their highly simplistic 
nature, cannot be taken as very quantitative, but do give a qualitative impression 
of the likely performance of the pipeline in semi-realistic situations. 
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5.6.1 Selecting a baseline lightcurve 

A totally flat baseline might have been chosen to add the simulated microlensing 
event peaks to, but it is felt that a more stringent and realistic test is to add 
them to a real lightcurve from the variable object database. However, to keep the 
test reasonably simple so that it would be expected that a significant fraction of 
lightcurves would pass the test, the lightcurve selected for the baseline was chosen 
to be one which had previously been identified by the pipeline as a "constant" 
lightcurve. In other words, a constant fit satisfied the criterion for the x 2 °f 
the fit. In addition, either no significant coherent bumps were found in the 
lightcurve, and thus no other fitting functions were fitted, or a bump or bumps 
were present, but the fitted x 2 ' s did not improve on the constant fit value. In 
addition, it is desired that both LT and PA data exist, and that the number of 
data points in both LT and PA are close to their maximum possible values. The 
first 500 lightcurves were re-run using the current pipeline, (not fixing the fitted 
flux amplitude ratio of LT and PA data). Of these lightcurves, only 8 were both 
classified as "constant" as described above and have both LT and PA data. Two 
of these lightcurves had global \ 2 values which were almost equally the lowest 
of the 8, but one has clearly more data points. This lightcurve, number 211, 
is therefore chosen as the baseline for this exercise. It has a global \ 2 value of 
3.95, (remembering that this should be divided by a factor of approximately 2 
to allow comparison with the expected \ 2 value for a truly constant lightcurve, 
~ 1). The number of data points is 371 in LT and 282 in PA. The lightcurve of 
variable object 211 is shown in Figure 15". 601 To allow easier plotting and insertion 
of fake events, both the flux and error values of the PA data set are divided by a 
scaling factor of 10 which, as shown in Section |4.6.1[ is approximately the average 
ratio between the PA and LT data. This process requires that the assumption be 
satisfied that the variations in PA flux are symmetrical on average around zero 
flux, which is confirmed by inspection of Figure 15.601 Inspection of this plot also 
shows that despite its classification by the pipeline (which may only signify that 
this lightcurve is not fitted very well by a simple skew-sinusoid model), lightcurve 
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211 is clearly not very constant. Significant and coherent variations can be seen, 
especially in the final season of PA data and the first season of LT data. These 
are reminiscent of the kind of flux variations seen in semi-regular variable stars, 
but might also be the sum of the flux variations of several variable stars. Such 
a variable baseline was not quite the expected character of the light curve, but it 
is not thought that the variations are so extreme as to preclude its use as the 
baseline for inserting the fake events. The effect on the fake lightcurve test is to 
make it harder for the pipeline to find candidates. 

Inserting fake Paczynski events 

After inspection of lightcurve 211, and the observation that the variations in that 
lightcurve were generally of the order of 10 ADU/s, it was decided to create fake 
events with Paczynski peak fluxes ranging from 10 ADU/s to 90 ADU/s with 
specific peak flux values of (10, 30, 50, 70, 90) ADU/s. For reference the peak 
fitted flux of the short event is about 400 ADU/s, so it is known that this range 
lies within the range of possible parameters. No additional Poisson noise is added 
due the extra flux. Based on the average lensing timescale in the Milky Way being 
of order of a month, lightcurves were created with lensing timescales £fwhm = (5, 
50, 500) days. The locations in time of the lightcurves were randomised in the 
range 51392.0 < t < 54796.0 (JD-2400000.5). Ten lightcurves were generated for 
each set of fake event parameters in order to smooth out the random variations 
in event location a little. The total number of fake lightcurves generated, then, 
was 10 x 3 x 5 = 150. It should be noted that this randomisation means that the 
expected pipeline inefficiency factor due to data only being taken for a fraction of 
the year has been naturally included in the overall efficiency This "data coverage 
factor" was estimated for this particular lightcurve by working out the fraction 
of the above range covered by seasons of data. The fraction of the total time 
between the beginning and end of the data covered by data taking was found 
to be 0.384. As a check, the fraction of created fake events where the to value 
was within the ranges defined by the seasons of data was also calculated, and 
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Table 5.20: Table giving the original parameters for the 13 fake events successfully 
found by the candidate selection pipeline. 



Fake 


to 


^fwhm 


Peak Flux 


to 


Lightcurve 


(JD-2400000.5) 


(days) 


(ADU/s) 


in-season? 


93371 


2232.92 


50 


30 


YES 


93435 


1915.26 


50 


70 


NO 


93438 


1504.51 


50 


70 


YES 


93443 


3279.80 


500 


70 


YES 


93444 


4424.07 


500 


70 


YES 


93447 


1935.85 


500 


70 


NO 


93448 


2116.21 


500 


70 


NO 


93463 


1903.74 


50 


90 


YES 


93468 


1894.20 


50 


90 


YES 


93471 


3400.73 


500 


90 


NO 


93472 


4644.21 


500 


90 


NO 


93475 


4739.17 


500 


90 


YES 


93477 


1874.75 


500 


90 


YES 



found to be 0.36. These numbers are not strictly the exact efficiency factors due 
to the small irregular gaps in data coverage within each season, and because it 
is allowed and possible for the pipeline to find events with to out of season as 
long as the specifications of Cut 6 (see Section |4T3~|) are followed. Also, the above 
figure is an overestimate, as the total timespan used both starts and ends with a 
season of data, thus including one fewer "off season" than should be included. It 
is however consistent with the time range of the fake events themselves. It should 
be remembered that even with no "noise" on the baseline, the pipeline would not 
be able to find much more than the above fraction of events, depending on how 
many events of each timescale were included. 



Results of fake lightcurve fitting 



The lightcurve numbers quoted in Tables 15. 20} I5.22[ I5.23[ and 15.241 are a nominal 
numbering system starting after the number of the final "real" lightcurve, which 
was 93240. 
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For each event found by the pipeline, it was calculated whether the original to falls 
inside or outside the ranges of the data. These determinations are also recorded 
in the last column of Table 15.201 

A benchmark figure for comparison 

In order to put the above figures into some kind of context and shed some light 
on the causes of the overall efficiency figure in the above experiment, a simpler 
test is also conducted to provide a baseline figure with which the number above 
may be compared. An identical 150 fake lensing events as before are generated, 
except in this experiment the baseline of the lightcurve is not a realistic variable 
lightcurve but is completely flat. To provide some statistical realism for the fitting 
routine, a "jitter" is added to the flux points corresponding to alternately raising 
and lowering the Paczyhski peak flux values by an amount equal to the size of 
the actual one a error bar (of the realistic data of lightcurve 211). 

When this test is carried out, 31 of the 150 lightcurves are passed by the selection 
pipeline as having a detectable event in, of which 5 are classed as "Paczyhski" 
events rather than "Mixed" events. Examination of the fitted lightcurves shows 
that the correct peaks were fitted in those events detected. Two of the 5 "Paczyhski" 
fits have timescales of 5 days, which are the only fake events recovered with the 
shortest of the three timescales, and the other three are 50 days. As with the real- 
istic baseline data set, no events were detected with peak fluxes below 30 ADU/s. 
The above detection rate represents a percentage of 20.7%, which is comparable 
to the ~ 38% data coverage fraction. There is no reason to expect a precise cor- 
respondence to this figure, as the actual number found will depend strongly on 
both the timescales and peak fluxes chosen, which are clearly arbitrarily chosen in 
this case. By comparing the results of this test with the test using lightcurve 211 
as the baseline, the effect of the variability of the realistic lightcurve used in the 
above test can be separated out. The deterioration in detection efficiency between 
these experiments caused by using a "realistically varying" baseline lightcurve is 
therefore 12%, corresponding to multiplying the "flat baseline" efficiency by a 
factor 0.42. Therefore the number of detected events is reduced by 58% by using 
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Table 5.21: Table giving the numbers of events found and some statistics for 
comparison between the two baselines used, for the events successfully found by 
the candidate selection pipeline, out of the original 150 fake events. 



Baseline 


Number 


Fraction 


Number found with 


Fraction IN 


Fraction 


used 


found 


of total 


^fwhm (5,50,500) days 


data seasons 


IN LT data 


"Flat" 


31 


20.7% 


(2,11,18) 


61.5% 


58.1% 


"LC 211" 


13 


8.7% 


(0,5,8) 


67.7% 


38.4% 



this particular "realistic" baseline. 

Table 15.211 gives the comparison of the statistics of detected events between the 
two baselines used. Given that there is an almost whole season more of LT data 
than PA, the fact that the percentage of events found within the LT data for the 
LC211 baseline is less than 50%, whereas using the flat baseline it is much more 
proportional to the relative quantities of data may imply that it is harder for the 
pipeline to find events in the LT due to the statistical noise being greater. This 
seems to be consistent with the events found in the totality of lightcurves, where 
fewer events were also found in the LT than PA data. The percentage of events 
which are found with to within a season of data is higher in both cases than the 
fraction of the total timespan taken up by the seasons of data themselves. This 
is expected, as it is much less likely for the specifications of Cut 6 to be satisfied 
if t is out of season. The preponderance of longer timescale events found may 
also be explained by their relatively more even data coverage compared to their 
timescale, and also that there will on average be greater numbers of data points 
within the peak to aid the fitting routine in finding it. 

Recovery of fake event microlensing parameters 

The fitted parameters of the 13 events passed by the candidate selection pipeline 
were investigated to see how well the original parameters had been recovered. 
The fitted parameters and the comparison to the original quantities are sum- 
marised in Tables I5.22[ I5.23[ 15.241 and 15.251 for the parameters -Blt, to, £fwhm 
and AFlt respectively. On the whole, most of the original parameters are recov- 
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ered quite well. The baseline flux parameter, which is set equal to zero for the 
fake lightcurves is generally found to be of the order of 2 ADU/s, compared to 
the magnitudes of the input flux peaks which vary from 30 to 90 ADU/s. The 
average percentage magnitude of error in the fitted baselines, compared to the 
AF values, is therefore 3.0%. The fitted values of to are also well recovered, 
being on average only 3.3% of the £fwhm values in error. The peak flux values 
are also recovered well, with the mean error being 6.4%. The worst recovered 
of the four fitted microlensing parameters are the values of £fwhm- The original 
values of this parameter are only recovered to within 21% on average. A clear 
tendency can also be seen among the fitted values of £fwhm to underestimate 
the original parameters. 11 out of 13 events have £fwhm less than the original 
value. One possible theory to explain this is that sometimes a better fit can be 
obtained to a peak in the data by combining the peak of the variable part with 
the Paczyhski part of a mixed fit, which means that some of the "work" is done 
by the variable part, leaving an inaccurate estimate of the Paczyhski parameters. 
However, given the clear variations in the lightcurve, a worst parameter error of 
21% and all others in single figures is not considered too bad. Another important 
point to consider is that the pipeline is tuned quite heavily towards only allowing 
through more reliable candidates, even at the expense of rejecting events that 
may be quite impressive, but have an element of doubt attached to them for one 
reason or another. Therefore, it is good to observe that no "false positives" were 
found, even when the Paczyhski peak amplitude was as low as 10 ADU/s. It must 
be remembered, however, that there may be a bias against this inherent in the 
method used, as the baseline lightcurve was specifically selected as one that had 
previously been found not to have any flux deviations that were consistent with 
a microlensing explanation and significant enough to be found by the pipeline. 
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Table 5.22: Table comparing the original and fitted baseline fluxes for the 13 fake 
events successfully found by the candidate selection pipeline. 



rake 


Baseline Flux 


Baseline 


group mean 


group mean 


group mean 


T * 1 J_ 

Lightcurve 


(original) 


Flux (fitted) 


of A B.Flux 


of |AB.Flux| 


c 1 AB Fluxl /ft-/ \ 

of 1 AF ( % ) 


93371 





-1.3 ± 0.2 


-1.32 


1.32 


4.4 


93435 





-1.9 ± 0.2 


-0.54 


1.31 


1.9 


f\ O A (JO 

93438 





0.8 ± 0.3 








no A A o 


U 


0.2 ± U.8 








93444 





-3.8 ±0.6 


±0.98 


2.88 


4.1 


93447 





1.2 ±0.4 








93448 





1.3 ±0.4 








93463 





-2.0 ±0.2 


-2.06 


2.06 


2.3 


93468 





-2.1 ±0.2 








93471 





3.2 ±0.2 








93472 





-3.6 ±0.3 


0.41 


2.21 


2.5 


93475 





1.1 ±0.2 








93477 





0.9 ±0.5 












overall means 


±0.074 


2.18 


3.0 



Table 5.23: Table comparing the original and fitted to parameters for the 13 fake 
events successfully found by the candidate selection pipeline. 



Fake 


t (original) 


to 


tittcd— original 
<FWHM 


group mean of 


Lightcurve 


(JD-2400000.5) 


(fitted) 


fractional ratio (%) 


93371 


2232.92 


2228.3 ±0.3 


-0.093 


9.3 


93435 


1915.26 


1910.8 ±0.7 


-0.089 


6.8 


93438 


1504.51 


1506.9 ±0.8 


±0.048 




93443 


3279.80 


3325 ± 4 


±0.090 




93444 


4424.07 


4428 ± 4 


±0.0071 


2.5 


93447 


1935.85 


1934 ±2 


-0.0029 




93448 


2116.21 


2116 ± 2 


-0.0012 




93463 


1903.74 


1903.0 ±0.3 


-0.015 


1.0 


93468 


1894.20 


1894.5 ±0.2 


±0.0054 




93471 


3400.73 


3416 ±3 


±0.031 




93472 


4644.21 


4660 ± 3 


±0.031 


1.9 


93475 


4739.17 


4745 ± 2 


±0.011 




93477 


1874.75 


1872 ± 1 


-0.0049 










overall mean (%) 


3.3 
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Table 5.24: Table comparing the original and fitted £fwhm parameters for the 13 
fake events successfully found by the candidate selection pipeline. 



T~ 1 1 

Fake 


^FWHM 


^FWHM 


C i 1 

tract, change 


group mean of 


Lightcurve 


(orig.) (days) 


(fitted) (days) 


fitted— original 
original 


tract. change (%) 


93371 


50 


26 ±2 


-0.49 


49 


93435 


50 


43 ± 1 


-0.15 


15 


93438 


50 


42 ±3 


-0.16 




93443 


500 


240 ± 10 


-0.52 




93444 


500 


567 ± 3 


+0.13 


23 


93447 


500 


430 ± 10 


-0.14 




93448 


500 


440 ± 10 


-0.11 




93463 


50 


44 ±1 


-0.11 


9.6 


93468 


50 


46.1 ±0.9 


-0.078 




93471 


500 


290 ± 20 


-0.43 




93472 


500 


540 ± 20 


+0.082 


22 


93475 


500 


367 ±6 


-0.27 




93477 


500 


4453 ± 8 


-0.094 










overall mean (%) 


21 



Table 5.25: Table comparing the original and fitted Peak Flux parameters for the 
13 fake events successfully found by the candidate selection pipeline. 



Fake 


Peak Flux 


Peak Flux 


fract. change 


group mean 


Lightcurve 


(orig.) (ADU/s) 


(fitted) (ADU/s) 


fitted— original 
original 


tract. change (%) 


93371 


30 


31.8 + 0.8 


+0.060 


6.0 


93435 


70 


69+1 


-0.013 


2.4 


93438 


70 


68 + 3 


-0.034 




93443 


70 


66 + 2 


-0.059 




93444 


70 


72.2 + 0.5 


+0.031 


6.1 


93447 


70 


65 + 1 


-0.071 




93448 


70 


64.0 + 0.9 


-0.085 




93463 


90 


95 + 1 


+0.058 


5.4 


93468 


90 


95 + 1 


+0.050 




93471 


90 


106 + 4 


+0.18 




93472 


90 


95 + 2 


+0.054 


9.3 


93475 


90 


96.5 + 0.6 


+0.072 




93477 


90 


84.3 + 0.8 


-0.063 










overall mean (%) 


6.4 
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Figure 5.59: The extended lightcurve of object number 39599 in the 2008 pho- 
tometry, fitted in the top panel with a joint fit, and in the bottom panel with a 
reduced Paczyhski fit, after the variable part of the fit to Figure 15.461 has been 
subtracted. 
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Figure 5.60: Lightcurve of object number 211 in the 2008 photometry, which is 
used as a baseline for the fake lightcurve test of the pipeline. 



Chapter 6 

Discussion and forward look 



6.1 Introduction 



In this Chapter, the main results from this Thesis are summarised along with the 
main conclusions. A brief perspective on the possible future development of this 
work, and microlensing investigations in general, is given. 



6.2 Summary of Thesis 
6.2.1 The Project 

The Angstrom Project is an international collaboration which has gathered four 
seasons of data (plus a pilot season), mainly using the robotic telescopes the 
Liverpool Telescope and Faulkes North telescope, together with the BOAO 1.8m 
and Maidanak 1.5m telescopes from a survey of the central bulge region of the 
Andromeda galaxy, M31. Difference ima ging has been applied to these data, us- 



ing a modified version of the ISIS code ( lAlard and Luptonl . Il998l ). and variable 



sources identified. The resulting variable object lightcurves, augmented by ear- 
lier data from the POINT- AG APE microlensing survey (re-analysed by us), have 
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been filtered using an automated candidate selection program which divides the 
lightcurves into microlensing candidates, variable star candidates, or no signifi- 
cant variation, and also classifies candidates according to signal to noise param- 
eters. The lightcurves are also subjected to r eal-tim e analysis by the Angstrom 



Project Alert System (APAS) (IDarnlev et all l2007ah . which is designed to detect 



fast-changing transients, with the aim of allowing immediate follow-up by other 
telescopes while the event is still in progress. 



6.2.2 Summary of Thesis 

Early on in the study period, the common overlap region of image data from our 
pilot season was analysed, at a time when, due to a telescope fault, LT images 
had random rotations and translations. This study discovered that the area rep- 
resenting the overlap of all images was less than 50% of the area of the Angstrom 
field. This fact informed a dec ision that it would be worthwhile altering the 



"ISIS" flAlard and Lupton 



19981 ) DIA code into what became "AngstromlSIS" . 



By an analysis of comparable small regions on images from the two telescopes it 
was found that the flux ratio between LT and FTN in i'-band data was 1.081 and 
in r band data was 0.305. These numbers were used in the candidate selection 
routine to link together the fitted amplitudes of variable stars in order to reduce 
the number of fitted parameters. 

The variable star candidates produced by the selection pipeline which had both 
PA and LT data were analysed to investigate whether there was a constant or 
other stable relationship between the amplitudes of variability measured in the 
data from the two telescopes. No clear simple constant ratio between the flux 
amplitude of variability in PA and LT or FTN data was found, although there 
does appear to be limited range of possible values, as shown in Figure 14.101 This 
investigation was only performed on the data extant in 2007, so in the future it 
is intended to re-investigate the flux ratio issue as part of a general reanalysis of 
the variable stars in the data, using the most up to date lightcurves. If a general 
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rule could be found for the relationship between the flux amplitude ratio and the 
flux amplitude, then this would enable the use of fewer parameters in the fitting 
in the selection pipeline, which in turn would increase the speed and efficiency of 
the program. 

Throughout the duration of this work, the so-called "short event" ANG-06B- 
M31-01 was analysed and then re- analysed several times as data processing, par- 
ticularly DIA, improved and new data added to the lightcurves. This analysis 
was particularly aided by data from the Maidanak telescope which fortuitously 
covered the span of the main flux spike. An attempt was made at estimating the 
likely mass of the lens star for this event (assuming it was caused by microlens- 
ing). Due to our lack of knowledge of the type and hence initial flux of the source 
star and of the relative source-lens velocity, and the small number of data points 
covering the flux peak, which is in turn due to the short duration of the event, an 
accurate estimate was not possible. However, the results indicate that the most 
likely type of star for the lens is a giant star. 

The most time-consuming aspect of all the work performed was the writing, 
developing and testing of an automatic candidate selection pipeline, and the 
application of this pipeline to the selection of both variable star and microlensing 
candidates, both after the third full season 2007 of Angstrom data and again 
mid-way through the fourth season, 2008. After the 2007 selection pipeline run, 
the results were further analysed to investigate the spatial, variable period and 
skewness distributions of the variable star candidates. 

A brief investigation was also made of the properties of the candidates selected 
from the 2008 lightcurve data, but, due to their low number, few firm conclusions 
may yet be drawn from this. Further investigation using Monte Carlo modelling 
techniques will be required to illuminate the results and allow more definite con- 
clusions to be drawn, for example about the distribution of matter in the M31 
bulge. 
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6.2.3 Conclusions 
ANG-06B-M31-01 

Initially, when the event was first detected, the fit to the main flux spike was not 
sufficiently convincing to definitively classify this event as due to microlensing. 
With the latest data, it now appears that the fit to a lensing lightcurve is much 
improved and hence the event is almost certain to be microlensing. The profile 
does not fit any other known phenomena and is a reasonable fit to a lensing 
lightcurve although the x 2 /d.o.f. of the fit, either for the whole lightcurve, or 
more particularly in the peak region, is not as good as would ideally be desired. 
Some possible reasons for this are described in Section 15.3.71 

The event was modelled using a combined reduced Paczyhski fit plus a variable 
component. The main parameters of the fit to this event are approximately 
summarised below: 

£fwhm : 2.1 ± 0.8 days, taking into account the fitting using the Maidanak data, 
or 1.36 ±0.1 days using only the most recent photometry. If confirmed, this 
timescale would be the shortest known microlensing event in M31. 

to (JD-2400000.5) = 53993.43 ± 0.01 

Period of variable: P = 245.15 ± 0.14 days 

Phase of variable (JD-2400000.5) : O = 54131 ± 1 

The variable was almost exactly described by a perfect sinusoidal function, 
peak i-band magnitude estimate: i = +18.07 ± 0.06 
central estimate of /3 = 0.015 ± 0.015 

estimate of using the above value of (3: tE = 41 ± +44 days. 

During the fitting process, when data from the Maidanak telescope were added, 
the best fit to a lensing lightcurve was obtained when the flux scaling factor 
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between Maidanak and LT data was found to be equal to i^Maid/^LT — 1 11- This 
is the first estimate (although still very approximate) that has been made of the 
scaling between the fluxes from these two telescopes. 

In the mass estimate described above, the most likely combination of lens and 
source locations was assumed to be both in the bulge. Taking this estimate as 
a guide, the most likely central mass for the lens star is 7.5 ± 21M . This is 
formally consistent with zero, but the mass cannot be less than zero. So to first 
order, we can only conclude that the lens mass is likely to be less than 28.5M 
and the mass of the source star is very likely to be M & > 1.44M , which is a 
limit derived by working backwards from the calculated peak flux magnitude and 
maximum possible magnification. 

Other brief duration lensing candidates have been found in M31 (for example, 



Auriere et al 



(|200l|)), but ANG-06B-M31-01 is certainly one of the shortest du- 



ration events so far seen in M31, and may be the shortest. 



6.2.4 Lensing candidates 

The 20 lensing candidates selected by the candidate selection pipeline and pre- 
sented in Section 15.5.21 clearly span a range of "qualities" , although they all 
pass all twelve cuts. On the whole, although some of the lightcurves do appear 
noisy, it is felt that they all have been correctly identified as containing a clear 
flux anomaly which is consistent with microlensing. Whether these lightcurves 
actually were caused by microlensing events will probably never be known. Con- 
firmation of whether a lensing event actually occurred, after the event, can only 
really currently be ac hieved in M31 throug h observations with the HST, as was 



done, for example, by lAuriere et al.l (120011 ). The spatial distribution of the lens 



ing candidates is presently inconclusive. It is presented in Figure 15.381 as a matter 
of record. Of more interest, perhaps, at this stage is the histogram of lensing can- 
didate timescales which does already seem to contain a certain amount of useful 
information. In the Milky Way, a typical lensing event is expected to last ~ 30 
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days, but in this survey, half the events lie at £fwhm < 15 days. The Angstrom 
Survey has been specifically designed to have a high cadence of data collecting 
with a view to being better able to find short events exactly within this time 
range. It should also be noted that the lowest timescale found in the main sam- 
ple (not including the "short event") was 3 days. Discovering the significance 
of the distribution of timescales found so far will have to wait until Monte Carlo 
modelling is performed using the same candidate selection pipeline. When this 
is complete, we will know better whether the timescale distribution presented 
here in Table 15.191 correctly reflects the actual distribution of event timescales 
or whether some aspects are affected by selection or systematic effects cause by 
the pipeline and/or the way the data have been collected or analysed. The fact 
that all microlensing collaborations seem to design their own systems of cuts to 
perform selection, no two of which are identical, makes the process of compari- 
son of results much harder. I feel that some kind of standardisation of methods 
would be very useful. The way that the Angstrom candidate selection pipeline 
has been designed, to systematically include variable baselines from the start, is 
a novel method. Undoubtedly this has also made filtering out "inappropriate" 
lightcurves and fits more complicated, and the process of experimentation with 
improving the way the cuts work should not be seen as a completed, but it does 
seem from the experience gained so far that the goal is an achievable one, though 
difficult. 

Testing of candidate selection pipeline using fake lightcurves 

A simple test of the candidate selection pipeline was carried out, searching for a 
variety of fake Paczyhski curves, having a range of peak amplitudes, timescales 
and randomised to' s > inserted both into a basically flat lightcurve and into one of 
the "real" variable object lightcurves (lightcurve 211). This "realistic" baseline 
was selected as neither having a previously identified lensing candidate within it, 
nor a large overall variation from a flat lightcurve. The efficiency of the pipeline 
for recovery of the inserted events was 20.7% using a flat baseline and 8.7% using 
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the lightcurve 211 baseline. These figures should not be interpreted as the true 
detection efficiencies due to the unrealistic parameter distributions used in this 
test, but they do at least show that a significant percentage of real events are 
recovered, even in the case of a noisy baseline. The original parameters of the 
inserted lensing peaks which were passed by the selection process were recov- 
ered within a few percent, except in the case of the lensing timescale using the 
lightcurve 211 baseline, which was only recovered within about 20% on average. 
Perhaps significantly, of the fake lightcurves, only two with timescales of 5 days 
were recovered, and none were recovered when using the lightcurve 211 baseline. 
This confirms that our completeness to these short events is still low, despite our 
aim of making our survey sensitive to shorter events. When the real lightcurves 
were searched, half the events found had timescales < 15 days, and, if this group 
is subdivided into 5 day intervals, the interval 15 — 10 days has 4 events, 10 — 5 
days also has 4 events, and 5 — days has 2 events. So a fall off of the numbers of 
events detected is apparently seen below 5 days, but the statistics are not good 
enough to be sure with this time resolution. The pure numbers of events detected 
do not give any real information about the detection efficiency, however, as this 
depends on the predicted timescale distribution of actual microlensing events. 
In general, the number of real events are expected to rise at the low timescale 
end, due to the large predicted number of low mass objects, e.g. brown dwarfs, 
expected to exist. If this is correct then the fall off in detection efficiency at 
low timescale would be even steeper than shown by the timescale distribution 
of recovered events. It would of course be possible to increase the number of 
low timescale events detected, for example by changing the stipulations of Cut 
6 (see Section |4T5|) . but this would be at the cost of decreasing the accuracy of 
the parameters recovered from some of the events, especially the peak flux (and 
hence the magnification). There is no easy answer to the question of where to 
draw the line in this situation- there will inevitably be compromises between the 
numbers of events detected and their security and the quality of their recovered 
parameters. The fairly low numbers of events recovered at low timescales (e.g. 
less than 5 days) from the real lightcurves (although representing 10% of the 
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total found events) are a little disappointing, and further work is clearly required 
to make improvements in this area. As data quality generated from the ADAP 
improves with further development this will help as it is especially hard to find 
short events within noisy lightcurves. 

6.2.5 Variable distributions 

Several investigations of the lightcurves classified by the selection pipeline as 
"variable", which were good fits to a skew sinusoidal variable star model, were 
carried out, to discover what useful information could be extracted from the data. 
These included investigations of the reduced x 2 ; period and skewness distribu- 
tions, along with cross-correlations between these two quantities and the with flux 
amplitudes of the variables, and investigations of the spatial distribution of the 
variable candidates. The results of this final investigation give much greater con- 
fidence that the variable candidates selected do belong to a physical population 
as their distribution has been shown to be highly non random in a self-consistent 
way. 

In order to get better statistics, the pure variable fits were augmented by adding 
mixed fits in which the lensing component was too small to pass the criterion 
for selection as a lensing candidate. In addition, four separate pipeline runs were 
conducted using different, limits on the ratio between the fitted LT and PA flux 
amplitudes, since this was not yet well known. One of these runs allowed the ratio 
to "freely" vary (within wide limits), but in the others the ratio was fixed. Period 
and skewness distributions for the 16 groups described above were investigated 
separately to check that they did not differ wildly between the categories and to 
ensure that it was therefore acceptable to combine the data into one larger group. 
Two examples of these plots may be seen in Section 15.5.11 When all categories 
were combined, the resulting data set contained 3160 lightcurves, of which 80.6% 
had reduced \ 2 < 5. The correlation between period and variable flux amplitude 
was investigated, and a tight relationship was found in which the mean period 
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at a given flux amplitude increased with flux amplitude. This relationship was 
fitted with a line whose equation was given in Equation 15.101 Lower periods, 
below about 80 days, did not fit this line well, and tended to bend down steeply 
towards zero period without much change in flux amplitude. It is thought that 
this might be due to the emergence of aliasing at shorter periods. When the 
data were transformed in order to remove this correlation, the "true" period 
distribution (at each constant flux amplitude) then emerged. This is shown in 
Figure 15". 2 II No significant correlation was found between the period and skewness 
distributions. 

The next investigation made was of the spatial distribution of the 2546 better 
fitted (reduced \ 2 < 5) variable and mixed candidates, also subdivided by flux 
amplitude. The total distribution, using all the candidates, was clearly peaked 
towards the centre of the galaxy, as would be expected given the large increase 
in the density of stars as the middle of the galaxy is approached. Some reduction 
of the detected spatial number density might have been anticipated due to the 
increased difficulty in detecting variability against the increased Poisson noise 
caused by the large increase in background light from the centre of the galaxy, 
and indeed subdivision of the spatial distribution into bins by variable flux am- 
plitude clearly showed that as the flux amplitude decreased, the central regions 
became relatively more sparse. This demonstrates the way the incompleteness 
for variable stars varied with flux amplitude. The cross-over point between the 
spatial distribution being centrally peaked and centrally troughed was found to 
be at around 4 — 5 ADU/s. The radial shape of the distribution appeared to 
be elliptical, with a ratio of semi-axis lengths o x jo y ~ 2 and this was initially 
confirmed by plotting a contour plot, Figure l5.25[ which showed that the ellipse 
was oriented approximately horizontally in the LT field, rather than either in 
the same d irection as t he M31 disk or the lower angle anticipated from previous 
modelling ( Stark . 1977I ). A more detailed investigation of the spatial distribution 



of the objects selected as variable star candidates was also performed in Section 
15.5.11 The first part of the investigation used radial and azimuthal binning of the 
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variable star candidates to investigate the azimuthal distribution as a function 
of radius. When the derived data points were plotted and a constant line fitted, 
the best fitting angle at which the azimuthal number distributions peaks was 
161.3°(±1.61°fitting error), or, alternatively, —18.7°. 

The second part of the investigation was of the purely radial distribution of 
variable star candidates. Dividing the area once again into concentric annuli of 
equal area, and adjusting for areas which had incomplete data due to the edge 
of the field, the dependence of the number density with radius was plotted. This 
could be fitted reasonably with a de Vaucouleurs Law within the uncertainties 
caused by the fairly low statistics. The number density in the central region was 
found to decline roughly linearly inside r = 20" which may be explained both 
by the true incompleteness of the survey due to increasing photon noise from 
the galaxy and by the systematic affect of masking out the very central region 
in the difference imaging. By subtracting this linear fit from the de Vaucouleurs 
fit above, the functional form of the completeness in the central region could be 
estimated if the (large) assumption was made of this function being equal to 1 at 
(r= 20"). This was done, and was shown in Figure I5TTH 

6.3 Future Work 
6.3.1 Known Problems 

The annual gaps in temporal coverage due to Andromeda being below horizon in 
the Northern Hemisphere are really troublesome. The duration of the off season 
during which Andromeda is too close to the sun to observe is annoyingly similar 
to the period of variability of some variable stars, and this leads to a high chance 
of aliasing in the fitting, as shown by Figure 15.171 In turn, lack of unambigu- 
ous knowledge of the baseline leads to serious doubt as to whether the fitted 
microlensing component is just that or rather some aspect of the variable star 
that was insufficiently covered by the data on previous or subsequent occurrences. 
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There seems to be no way to improve the data coverage problem, however, with- 
out expensive solutions such as a telescope in Earth orbit diametrically opposite 
the Earth. 

The combined (Pac.+ var) fitting function is very flexible and does not always 
do what would be hoped for- it has the freedom to use the microlensing part 
to model long low bumps which may have a moderate area under bump which 
are probably due to stellar variability and may leave some sharp spikes (low 
or area underneath curve), which are likely to be due to microlensing, unfitted. 
In this work the areas of the lightcurve in which the Paczyhski curve may be 
fitted have remained unlimited, but is a possibility worth considering to limit 
range of fitted t to those areas of the lightcurve where /<&/ is concentrated? This 
would usually require the pre-fitting of a variable star fitting function, then a 
re-assessment of those areas of the lightcurve which are not a good fit to this. 
This could be done within the same iteration. 

There is a remaining issue with coherent groups of deviant points in time, where 
the summed deviations are not sufficient to make x 2 /d.o.f. greater than a reason- 
able threshold. Although this issue has been partially addressed by Cut 12, a few 
lightcurves with secondary deviations which are low enough to escape Cut 12 but 
still noticeable by eye may be seen in the final sample. Since the modelling applied 
in this work only allows one lensing curve to be fitted to the lightcurve, the exis- 
tence of unexplained lightcurve bumps cast doubt on the veracity/appropriateness 
of the model, despite passing the cuts. x 2 /d.o.f. is certainly not a sufficient cri- 
terion to distinguish "good" lightcurves, but it is hard to know where to draw 
the line as regards complicating the models to fit more complicated lightcurves. 
Possible examples of this might be modelling binary lensing events, or perhaps 
even a nova plus a lensing event plus a variable star in the same lightcurve. 
Given sufficient lightcurves, these relatively rare collections of deviant points will 
occur anyway by chance, but the level of Cut 12 is set so that much fewer than 
one of the cut deviations would be expected in the 93240 lightcurves analysed. 
However, there may be some variable stars which have more "spiky" lightcurves 
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than are currently modelled with the cosinusoid function, and these may possibly 
be mistaken for an additional lensing spike in the lightcurve given sufficiently 
unfortunate time sampling. 

Currently, the parameters in the DIA photometry pipeline are set to favour the 
detection of as many variable events as possible, at the expense of re-detecting 
some events many times. This conscious choice leads to a larger number of repeat 
and near-repeat lightcurves. It is possible that the balance between sensitivity 
to detection and robustness against finding repeat lightcurves may need to be 
examined again in the future, and perhaps a different compromise found. 

A very important area in this work is the error estimation on data points, espe- 
cially if we are relying on \ 2 i n the peak region to select lightcurves. In the 2007 
photometry, based on analysis of the "short event", ANG-06B-M31-01 and also 
the analysis of the x 2 /d.o.f. distribution of the variable star candidates, it seems 
that either the errors were underestimated or there is unexplained non-random 
variation in the lightcurves. In the event ANG-06B-M31-01, upward rescaling of 
the error bars in the lightcurve was found to be required to make the \ 2 /d.o.i. 
~ 1.0, and in the case of the variable star candidates analysed in Section r5.5.1[ 
a similar re-scaling of order V2.0 was apparently implied by the distribution of 
Y 2 /d.o.f. values f ound in the fitting (see Figure EHE]) • It has since been discovered 



by 



Kerins et al 



(120091 ) that indeed our data are not photon noise limited, and 
that our % 2 /d.o.f. should be adjusted downwards by a factor of (1.4 2 = 1.96) if 
they are to be compared with photon noise limited x 2 /d.o.f.'s which are ~ 1 if a 
good fit is achieved. 



6.3.2 Suggested future extensions 

It might be desirable at some point to add more variable star templates, for 
example, fitting functions that replicate "saw-tooth" Cepheids or variables with 
"M" shaped lightcurves, for instance RV Tauri stars or some eclipsing binaries. 
This would enable investigation of the relative fractions of the different classes 
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of variable stars in the totality of variable stars in M31 and help us learn more 
about their physical properties, for e.g. period and skewness. The current si- 
nusoidal model acts as a promising "proof of principle" and shows that fitting 
model lightcurves can provide real information (over and above that obtained 
from periodicity analysis) about variable stars in M31. 

It would also be possible to add a routine for more clearly distinguishing classical 
novae which can sometimes appear to have similar lightcurves to microlensing, 
especially given unfortunate temporal sampling of the data. In general, nova 
lightcurves rise more steeply than they fall, and this asymmetry could be mod- 
elled. Currently when a Paczyhski curve is fitted to what may be a nova lightcurve 
we usually expect an flux undershoot before the peak and an overshoot after peak. 
This may also be tested for. 

A small but important change which has not yet been made would be to introduce 
divisions between variable signal to noise categories which would have a logarith- 
mic spacing. If this was done the spacing of the categories would get smaller 
when the signal to noise gets lower. This would be useful as there are many more 
of these lower signal to noise lightcurves than the higher signal to noise ones and 
with better resolution it would be possible to get better information about the 
signal to noise distribution. 

At some point it would be useful to re- visit the work done on variable stars using 
the 2007 photometry, using the most up to date 2008 photometry. If similar 
overall conclusions may also be drawn from a different data set, then this would 
strengthen the confidence in the results. Alternatively, if the conclusions so far 
drawn are not supported, the reasons why not would have to be investigated. 

I believe my development of, and experimentation with, the "Quality Factor" 
indicator of the quality of lensing candidate may point to a different way of 
thinking about candidate selection. Often a candidate which is quite impressive 
by eye will fail at least one of a set of rigid cuts. Therefore, a system of cuts 
calibrated so that only excellent candidates pass all the cuts will throw away 
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many good candidates which are nevertheless worthy of further investigation. In 
the regime where Angstrom sits, with significant amounts of stellar blending from 
variable stars, this is especially true. Often the candidates that are thrown away 
because they failed a particular cut will have values for the other cut parameters 
which are even better than the candidates which pass all the cuts. Clearly it is 
not possible to go all the way down the path of using the accumulated total or 
product of all the cut parameters as the value upon which selection is based, as in 
that case one extremely good cut parameter could outweigh all the others. There 
must clearly remain some minimum standards. However, in this work it has been 
experimented with keeping these as low as possible without being unreasonable, 
and then using more "fuzzy" methods such as the Quality Factor to decide the 
order of priority among the remaining candidates. Another line of thought which 
follows on from this might be that candidate selection would be an ideal task for 
neural networks, which are very good at producing quick answers to the analysis 
of large quantities of data, once properly trained. This would be a way of getting 
away from the inevitable involvement of a human being who fairly arbitrarily 
decides what cut levels to set and whether a particular candidate is "acceptable" , 
or "good", which are themselves fuzzy concepts. It seems that in the "black 
art" of microlensing candidate selection there is no agreed set of cuts to use, and 
hence each collaboration uses their own magic formula to select events. In some 
cases, for e.g. POINT AGAPE, the disagreement over what cuts to use is so 
severe that the collaboration splits into two or even three camps which do their 
own independent candidate selection. It would be useful if all M31 collaborations 
decided a universally agreed set of cuts. This would mean that one could compare 
like with like, and say on a standard basis whether one collaboration had actually 
found more events than another, even without having to do all the subsequent 
Monte Carlo simulations, which are in themselves collaboration-dependent. 

It may be beneficial to check for lightcurve copies before running the main 
pipeline. Previously, such as in the 2007 photometry, this was not such a big 
issue as repeated events were only a small fraction of the total. In the 2008 pho- 
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tometry, however, the fraction of lightcurves selected by the "loose" cuts which 
were a copy, either exact or very similar, to another lightcurve was nearer 50%. 
Some photometric objects were found to have up to 10 lightcurves associated 
with them. This resulted in an approximate doubling of the running time for 
the pipeline. This in turn implies it may be now be worthwhile developing an 
algorithm to check for repeat or near repeat events which would be run before 
the main pipeline. Only the lightcurve with the most data points in the group 
would be kept, and then it might be possible to make a further choice based on 
the mean error on the flux points in the lightcurve. 



6.4 Future of Microlensing investigations 



Gouldl (ll995[ ) performed calculations suggesting that attempting a microlensing 
search towards M87 and the Virgo cluster in general would be technically possible 
(currently it would be necessary to use the Wide Field and Planetary Camera 
on the Hubble Space Telescope). It should be feasible either to search for high 
magnification "spikes" within M87 itself or to search in the intervals b etween the 
cluster galaxies for "Intra-cluster MACHO's" (ICMs) ioould . 1995). No team 
has so far successf ully proposed this experiment. For the even farther future, 
the Coma Cluster (I Gouldl . Il996l ) is an even richer cluster that would be even 
more interesting in which to search for ICMs, but, being about 5 times further 
away than the Virgo Cluster, is beyond the capabilities of any current telescope. 
However, with the advent of new space telescopes with larger dish sizes, such as 
the James Webb Space telescope, successful microlensing observations at these 
amazing distances may even become possible. 



Another prospect for the future might be the possible observation of cosmic 
strings, if they exist. These are in theory infinitely thin objects with a finite 



length and a finite mass, which would cause gravitational lensing i 
moved between us and a stellar source. It has been proposed in 



they sat, or 



Sazhin et al. 



(120031 ) that a cosmic string has already been seen as the most likely cause of 
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a double galaxy. Because strings are predicted to cause an unmagnified pair of 
images when sitting directly in front of a source, the lightcurve would rise linearly 
to an exact factor of two amplification of the original flux and then return to the 
original value, if the string was straight and was moving fast enough relative to 
the source that the changing flux could be detected. This characteristic lightcurve 
and the fact that they should produce a clear double image with no detectable 
foreground galaxy lens at all make it conceivable that cosmic strings could be 
detected using "macro" lensing or even with microlensing. This would be a very 
important discovery for fundamental physics. 

A more immediate prospect for the near future might be the routine detection of 
planets in another galaxy. The most obvious possibility, other than the Magellanic 
Clouds, is M31, as the nearest massive gala xy to the Milky Way The possibility 



of th is has been fairly extens ively studied (Chung et al 



20071 ) . and, as pointed out by llngrosso et al. 



2006 



de Paolis et al. 



(120091 ). may already have been ob- 



served by the POINT - AGAPE survey as first d escribed as the binary lensing 



candidate PA-00-S5 in 



Calchi Novati et al. 



( 120051 ). The discovery of planets and 



planetary systems in our own Galaxy has become relatively routine in the last 
few years, with over two hundred planetary systems known, each containing up to 
five planets. It would be very exciting if this process of discovery could also begin 
in the next few years in M31. There seem few technical barriers to this, except 
for waiting for the perfect binary lightcurve with sufficient temporal sampling to 
come along. Towards this end, high cadence surveys such as Angstrom have a 
much better chance of being able to detect and model any planetary deviations 
successfully, so we look forward to the future with anticipation. 



Appendix A 
Glossary 



2MASS = Two Micron All Sky Survey 

AGAPE = Andromeda Galaxy and Amplified Pixels Experiment 

Angstrom = Andromeda Galaxy STellar Robotic Microlensing project 

AAS = Angstrom Alert System 

ADAP = Angstrom Data Analysis Pipeline 

BSQF = Bump Sample Quality Factor 

CCD = Charge Coupled Device 

CDR = x 2 Difference Ratio 

Colour-Magnitude Diagram = CMD 

Columbia/ VATT = Colombia/ Vatican Advanced Technology Telescope 
DIA = Difference Image Analysis 

EROS = Experience pour la Recherche d'Objets Sombres 
FTN = Faulkes Telescope North 
GLRG = Giant Luminous Red Galaxy 
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HST = Hubble Space Telescope 
HWHM = Half- Width Half Maximum 

ICM = Infra-Cluster MACHO's (see definition of "MACHO" below) 

IMF = Initial Mass Function 

JWST = James Webb Space Telescope 

LC = lightcurve 

LMC = Large Magellanic Cloud 

LSN = Lensing Signal to Noise factor 

LT = Liverpool Telescope 

MACHO = MAssive Compact Halo Object 

MDM = Michigan Dartmouth M.I.T. 

MS = Main Sequence 

MEGA = Microlensing Exploration of the Galaxy and Andromeda survey 
NainiTal = Astronomical survey; also a town in India 
OGLE = Optical Gravitational Lensing Experiment 
PA = POINT AGAPE survey of M31 

POINT AGAPE = Pixel-lensing Observations with the Isaac Newton Telescope, 
Andromeda Galaxy Amplified Pixels Experiment 

PSF = Point Spread Function 

QF = Quality Factor 

RATCam = Robotic Automated Telescope CAMera (on LT) 
RGB = Red Giant Branch 
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SDSS = Sloan Digital Sky Survey 

SLOTT AGAPE = Systematic Lensing Observation at Toppo Telescope An- 
dromeda Galaxy and Amplified Pixels Experiment 

SMC = Small Magellanic Cloud 

SupIRCam = SUPERNova InfraRed Camera (on LT) 

WeCaPP = Wedelstein Calar alto Pixellensing Project 

WFPC2 = Wide Field and Planetary Camera II (an instrument on the Hubble 
Space telescope) 
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